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ABSTRACT 



We describe the modern formalism, ideas and applications of the instanton calculus for gauge theories 
with, and without, supersymmetry. Particular emphasis is put on developing a formalism that can 
deal with any number of instantons. This necessitates a thorough review of the ADHM construction 
of instantons with arbitrary charge and an in-depth analysis of the resulting moduli space of solutions. 
We review the construction of the ADHM moduli space as a hyper-Kahler quotient. We show how 
the functional integral in the semi-classical approximation reduces to an integral over the instanton 
moduli space in each instanton sector and how the resulting matrix partition function involves various 
geometrical quantities on the instanton moduli space: volume form, connection, curvature, isometries, 
etc. One important conclusion is that this partition function is the dimensional reduction of a higher- 
dimensional gauged linear sigma model which naturally leads us to describe the relation of the instanton 
calculus to D-branes in string theory. Along the way we describe powerful applications of the calculus of 
many instantons to supersymmetric gauge theories including (i) the gluino condensate puzzle in = 1 
theories (ii) Seiberg-Witten theory in Af = 2 theories; and (iii) the AdS/CFT correspondence in M = 2 
and M = 4 theories. Finally, we brielfy review the modifications of the instanton calculus for a gauge 
theory defined on a non-commutative spacetime and we also describe a new method for calculating 
instanton processes using a form of localization on the instanton moduli space. 
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I Introduction 

Yang-Mills instantons [1] have provided an enduring interest for a generation of physicists and mathe- 
maticians. On the physics side, instanton configurations give the leading non-perturbative contributions 
to the functional integral in the semi-classical approximation [2]. After the initial disappointment that 
instantons were not going to provide a simple explanation of quark confinement, they have proved 
interesting and useful both in phenomenological models of QCD (see, for example [3-5]) and for de- 
scribing exact non-perturbative phenomena in supersymmetric gauge theories (as we describe in this 
review). On the mathematical side, instantons lie at the heart of some important recent developments 
in topology and, in particular, of Donaldson's construction of topological invariants of four-manifolds 
(see [6] and references therein). 

The main applications of instantons we will consider in this review are to supersymmetric gauge 
theories. Recent developments have provided an extraordinary web of exact non-perturbative results 
for these theories, making them an ideal field-theoretical laboratory in which to test our ideas about 
strongly-coupled gauge dynamics. Highlights include Seiberg duality for M = 1 theories, the Seiberg- 
Witten solution of Af = 2 supersymmetric Yang-Mills and the AdS/CFT correspondence. Typically 
supersymmetry constrains the form of quantum corrections allowing exact results to be obtained for 
some special quantities. Despite these constraints, supersymmetric gauge theories still exhibit many 
interesting physical phenomena including quark confinement and chiral symmetry breaking. Yang-Mills 
instantons have played an important role in some of these developments which we will review in the 
following. 

Instantons in supersymmetric gauge theories were studied in detail by several groups in the 1980's. 
This early work focused on the contribution of a single instanton in theories with J\f = 1 supersymmetry. 
Impressive results were obtained for the vacuum structure of these theories, including exact formulae 
for condensates of chiral operators. Although precise numerical answers were obtained, interest in 
these was limited as there was nothing with which to compare them. 1 The new developments motivate 

1 Note however, that several puzzles emerged on comparing numerical values for condensates obtained using different 
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the extension of the instanton calculus to theories with extended supersymmetry and, in particular, 
to instantons of arbitrary topological charge. This is the main goal of the first part of this review. 
Developments such as Seiberg-Witten theory and the AdS/CFT correspondence also yield precise pre- 
dictions for instanton effects, which can be checked explicitly using the methods we will develop below. 
These applications, which are described in the second half of the review are important for two reasons. 
Firstly, they provide as a quantitative test of conjectural dualities which underlie the recent progress in 
supersymmetric gauge theories. Secondly, they also increase our confidence in weak-coupling instanton 
calculations and some of the technology which they require such as Wick rotation to Euclidean space, 
constrained instantons, etc. 

A semi-classical evaluation of the path integral requires us to find the complete set of finite-action 
configurations which minimize the Euclidean action. In pure Yang-Mills theory, this was accomplished 
many years ago in pioneering work by Atiyah, Drinfeld, Hitchin and Manin (ADHM) [7]. In partic- 
ular, these authors found the complete set of self-dual gauge fields of arbitrary topological charge k. 
Their construction, which works for arbitrary SU(iV), SO(iV) or Sp(iV) gauge groups (but not for the 
exceptional groups), reduces the self-dual Yang-Mills equation to a set of non-linear algebraic equations 
(the "ADHM constraints") constraining a matrix of parameters (the 'ADHM data"). After modding 
out a residual symmetry group, each solution of the ADHM constraints defines a gauge-inequivalent, 
fc-instanton configuration. The space of such solutions, which we denote Tlk, is also known as the 
/c-instanton moduli space. 

The space ffllk, has many remarkable properties which will play a central role in our story. Firstly, 
apart from having some isolated singularities, it is a Riemannian manifold with a natural metric. The 
singularities are relatively mild, being of conical type, and have a natural interpretation as points where 
instantons shrink to zero size. In addition, the moduli space carries families of inequivalent complex 
structures and, when endowed with its natural metric, defines a hyper-Kahler manifold. This property is 
manifest in the ADHM construction, which is actually an example of a general method for constructing 
hyper-Kahler spaces known as the hyper-Kahler quotient. This viewpoint turns out to be very useful 
and we will find that many of the structures which emerge in the semi-classical analysis of the path 
integral have a nice geometrical interpretations in terms of the hyper-Kahler quotient. 

Applying the semi-classical approximation to a Green's function requires us to replace the fields 
by their ADHM values and then integrate over the moduli space Wlk- Constructing the appropriate 
measure for this integration, both in pure gauge theory and its supersymmetric generalizations, is the 
main problem we address in the first part of the review. The main obstacle to overcome is the fact the 
ADHM constraints cannot generally be solved except for small values of k. Hence, we cannot find an 
explicit unconstrained parameterization of the moduli space. However, we will be able to find explicit 
formulae for the measure in terms of the ADHM data together with Lagrange multiplier fields which 
impose the ADHM equations as 5-function constraints. Under certain circumstances we will be able 
to evaluate the the moduli space integrals even for general k. For instance at large-iV the ADHM 
constraints can be solved, at least on a certain generic region of the moduli space, and this means that 
the instanton calculus becomes tractable for arbitrary k. In the final Chapter we shall briefly explain 
approaches to instanton calculations. These will be re-examined in Chapter IV 
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important new developments that allow one to make progress even at finite N when one is on a Higgs 
or Coulomb branch. 

The plan of this report is as follows. As a prelude we briefly review the basic philosophy of instanton 
calculations. Chapter II provides a solid grounding of the ADHM construction of instantons in SU(iV) 
gauge theory. 2 Firstly, we describe some general features of instantons. This begins with the notion of 
zero modes in the instanton background, collective coordinates and the moduli space of instantons DJlk- 
Very general arguments then show that 971^ is a hyper-Kahler manifold with a natural Hermitian metric. 
After this, we describe all aspects of the ADHM construction of fJJtk, putting particular emphasis on the 
hyper-Kahler quotient description. Most of the material in Chapter II is necessarily very mathematical. 
A reader primarily interested in physical applications of instantons can skip most of §11.3.1, §11.4.1, 
§11.5 and §11.6. 

Chapter III, shows how the semi-classical limit of the functional integral reduces to an integral over 
DJlk with a volume form which is derived from the Hermitian metric constructed previously. However, 
in the setting of pure gauge theory, integrating out all the non-zero modes of the quadratic fluctuations 
around the instanton yields a non-trivial determinant factor which accounts for the one-loop effects in 
perturbation theory in the instanton background. We briefly describe, using results reviewed in [8], 
how the determinants may be evaluated in the ADHM background, although the resulting expressions 
are rather unsatisfactory (involving spacetime integrals). This latter point will not overly concern 
us, because in the supersymmetric applications the non-trivial parts of the determinant factors cancel 
between bosons and fermions. 

Chapter IV describes how the instanton calculus is generalized in a supersymmetric gauge theory. 
This is done in the context of M = 1, 2 and 4 supersymmetric theories with no additional matter 
fields. (Adding matter fields is considered separately in §VI.3.) First of all, we explain how to construct 
the Euclidean version of the theory by Wick rotation from Minkowski space and how this inevitably 
leads to a theory where the fermion action is not real. This is a fact of life in the supersymmetric 
instanton calculus, just as it is when fermionic theories are investigated on the lattice, but does not 
lead to any inconsistencies or pathologies. In an instanton background, there are fermion zero modes 
and their associated Grassmann collective coordinates. We show how these are superpartners of the 
bosonic collective coordinates and so in a supersymmetric gauge theory the moduli space dJlk is, itself, 
supersymmetrized. We show how this fits in with the hyper-Kahler quotient approach. In this section, 
we explain in detail the concept of the super-instanton. In particular, in the M = 4 theory, the 
appropriate supersymmetric instanton solution is only an approximate solution of the equations-of- 
motion, but one which captures the leading order semi-classical behaviour of the functional integral. 
We also explain in this section how the addition of VEVs for the scalar fields (for the M = 2 and 
4 theories) leads to Affleck's notion of a constrained instanton. Our point-of-view is that constrained 
instantons are — in a sense — only a mild modification of the conventional instanton: some of the collective 
coordinates cease to be exact moduli and the whole effect can be described by the turning on a potential 
on 9Jt fc . We call this approximate solution a "quasi-instanton" . Chapter V goes to describe the semi- 
classical approximation in a supersymmetric gauge theory. In all cases we can formulate the leading- 
2 The ADHM construction for the other classical groups is described separately in §VI.2. 
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order semi-classical approximation to the functional integral as an integral over the supersymmetric 
version of Wlk- 

Some generalizations, and other important miscellany, are collected in Chapter VI. In particular, 
we explain how the ADHM constraints can be solved on generic orbit of the gauge group when N > 2k; 
how to extend the ADHM construction to gauge groups SO(iV) and Sp(iV); how to add matter fields 
transforming in the fundamental representation; the effect of adding masses which break various amounts 
of supersymmetry; and how to define the notion of the instanton partition function. 

Chapters VII-IX each describe an application of the multi-instanton calculus. In Chapter VII we 
review the two distinct approaches to instanton calculus in Af = 1 theories developed by different groups 
in the 1980's. One approach follows closely the methodology developed in the first part of the report, 
working with constrained instantons in a weakly-coupled Higgs phase. As we review, this approach 
yields explicit agreement with the web of exact results which predict a precise numerical value for the 
gluino condensate in Af = 1 supersymmetric Yang-Mills. The second, more controversial approach, 
attempts to evaluate the gluino condensate directly in the strongly-coupled confining phase. We cast 
further doubt on this method, by showing explicitly that the resulting formulae violate the clustering 
property, a general axiom of quantum field theory, and therefore cannot yield the exact answer. This 
is just as one might have expected: instantons are a semi-classical phenomena and as such are not 
expected to yield quantitatively exact results in a strongly-coupled phase. 

We then go on in Chapter VIII to consider instanton contributions to the low-energy effective action 
of M = 2 gauge theories on their Coulomb branch. These calculations are very important because 
they can be compared with a completely different approach based on the celebrated theory of Seiberg 
and Witten. Perfect agreement is found providing strong evidence in favour of the instanton calculus — 
including all this entails like the imaginary time formalism and the resulting saddle-point approximation 
of the functional integral — but also in favour of Seiberg and Witten's ingenious theory. Several minor 
modifications of the original Seiberg- Witten solution for Af = 2 theories with matter are also described. 
These involve ambiguities in the definition of parameters appearing in the exact solution. 

In Chapter IX we consider conformal, or finite, gauge theories with Af = 2 or Af = 4 supersymmetry 
in their non-abelian Coulomb phase. Here, we show how various instanton effects for arbitrary instanton 
charge can be evaluated exactly in the limit of large N. The large- N instanton calculus that we develop 
provides substantial evidence in favour of the remarkable AdS/CFT correspondence which relates the 
gauge theories to ten-dimensional superstring theory on a particular background. In particular we 
will describe how, even at weak coupling, the large-A" instanton calculus probes the background ten- 
dimensional spacetime geometry of the string theory directly. 

In Chapter X we describe how instantons can be embedded in higher-dimensional theories as brane- 
like solitons. We show in detail, as expected from general principles, the resulting collective coordinate 
dynamics involves a certain a-model on the brane world-volume with the moduli space of instantons as 
the target space. This provides a useful way to understand the relationship of instantons to D-branes in 
superstring theory, a subject we go on to describe. Studying instantons in the context of string theory 
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provides a powerful way to derive many aspects of the instanton calculus developed in the previous 
sections and in the process removes much of the cloak of mystery surrounding the ADHM construction. 

Finally, in Chapter XI we describe two recent developments in the instanton calculus. Firstly, how 
the instanton is modified when the underlying gauge theory is defined on a non- commutative space. 
It turns out that the instanton moduli space is modified in a particularly natural way. Secondly, we 
describe a new calculational technique which promises to make instanton effects tractable even at finite 
N and for all instanton charge. The key idea is that in the presence of scalar field VEVs, a potential 
develops on the instanton moduli space and the collective coordinate integrals localize around the critical 
points of this potential. 

Appendix A details our conventions for spinors in different spacetime dimensions. Appendix B 
includes an introduction to hyper-Kahler geometry. Appendix C reviews some useful identities for 
calculations involving the ADHM data. 

There are many other excellent reviews of instanton physics including [4,9-13]; however these reviews 
concentrate on the single instanton. 

1.1 The philosophy of instanton calculations 

Before we begin in earnest, it is perhaps useful to remind the reader of the basic philosophy of instanton 
calculations. In quantum field theory, all information about the physical observables (i.e. the spectrum 
and the ^-matrix) can be obtained by calculating the correlation functions of appropriate operators. 
These correlation functions are defined by the Feynman path integral. The most convenient formulation 
is one where the correlation functions and the path integral are analytically continued to Euclidean 
spacetime. The path integral was first introduced as a formal generating functional for perturbation 
theory. However, thanks to the work of Wilson, it is widely believed that the Euclidean path integral 
actually provides a first-principles non-perturbative definition of quantum field theory. Implementing 
this definition in practice involves replacing continuous spacetime by a finite number of points and leads 
to the subject of lattice field theory which is beyond the scope of this article. Nevertheless, the success 
of this viewpoint inspires us to take even the continuum path integral seriously. 

In continuum quantum field theory we are generally limited to calculating at weak coupling. In four 
dimensions, the only exceptions are certain supersymmetric gauge theories which we will discuss further 
below. If the continuum path integral makes any sense at all, then the very least should expect is that 
it should yield sensible calculable answers at weak coupling. As mentioned above, the Feynman rules 
of ordinary weak-coupling perturbation theory can easily be derived from the path integral. However, 
even at weak coupling, the path integral contains much more information, including effects which are 
non-perturbative in the coupling constant. 
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In the simplest cases, the generating functional in Euclidean spacetime takes the schematic form, 



where $ denotes some set of fields with sources J and S[<&\ is the Euclidean action which is real 
and bounded below. The gauge theories we consider below are characterized (classically) by a single 
dimensionless coupling, denoted g 2 , and one may always re-scale the fields so that the coupling appears 
in front of the action as indicated in (1.1). Of course, in the quantum theory, we will certainly need to 
modify our discussion to account for the running of the coupling but we will, for the moment, postpone 
this discussion. 

The basic idea of the semi-classical approximation 3 is that, for small g 2 , the path integral is domi- 
nated by the configurations of lowest Euclidean action and we may proceed by expanding around these 
configurations. The simplest such configurations are the perturbative vacua of the theory (i.e. minima 
of the classical potential) and the corresponding expansion is just the ordinary loop expansion. How- 
ever the Euclidean action may have other minima with finite action. Such configurations are known 
as instantons the same logic dictates that we should also expand in fluctuations around them. For an 
instanton of finite Euclidean action S c \, the leading semi-classical contribution goes like exp(— S c \/g 2 ) 
and expanding in fluctuation leads to corrections which are suppressed by further powers of g 2 . 

As we will see, there are many extra complications and technical difficulties in carrying out this 
program in practice. As mentioned above, for asymptotically free theories of direct physical interest the 
coupling g 2 runs as a function of the energy and becomes large in the infra-red at the dynamical scale A. 
In this case, the fluctuations of the fields in the path integral become large and neither the instanton nor 
any other classical field configuration determines the physics. It seems therefore that instantons have 
little to tell us directly about these theories. On the other hand there are two types of theory we will 
meet where instanton methods are directly applicable. Firstly, and most straightforwardly, there are 
theories like J\f = 4 supersymmetric Yang-Mills theory where the /5-function vanishes and the coupling 
does not run. In this case we may set g 2 <C 1 and safely apply semi-classical methods. The second type 
of theory is asymptotically free but also contains scalar fields which can acquire a vacuum expectation 
value (VEV) spontaneously breaking the gauge group. The Higgs mechanism then cuts off the running 
of the coupling in the IR. If the mass scale of the VEV is much large than the dynamical scale A, the 
coupling small at all length scales and semi-classical reasoning is valid. 

In the case of a non-zero scalar VEV, we will also have to contend with problems related to the 
fact that instantons are no longer true minima of the action. In fact, both types of theory described 
above requires a more complicated application of the semi-classical method than simply expanding 
around exact solutions of the equations-of-motion. We will frequently solve the classical equations only 
approximately order-by-order in g 2 . However, the basic philosophy remains the same: as long as the 
theory is genuinely weakly coupled, an appropriate semi-classical approximation to the path integral 
should be reliable and we can use it to evaluate the leading non-perturbative corrections to observables. 
In the following chapters we will develop the necessary technology to accomplish this goal. 

3 The term "semi-classical" originates in the observation that g 2 appears in the exponent of (1.1) in exactly the same 
position as Planck's constant h would if we restored physical units. Thus the g 2 — > limit is identical to the h — > limit. 
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II Instantons in Pure Gauge Theory 



In this Chapter we describe instanton solutions of pure gauge theory, i. e. without any additional fields 
present. To begin with we introduce the central concept of the moduli space of instantons and describe 
how this space has a lot structure: it is a hyper-Kahler manifold with singularities. In §11.4 we describe 
the Atiyah, Drinfeld, Hitchin and Manin construction of arbitrary instanton solutions and then explain 
how the "ADHM construction" can be viewed as a particular example of the hyper-Kahler quotient 
construction. This point-of-view is necessarily rather mathematical and some of the background required 
is reviewed in Appendix B. 



II. 1 Some basic facts 

We start with pure SU(iV) gauge theory described by a Euclidean space action 

S[A] = J d 4 xti N F mn + i9k , (2.1) 
where the field strength F mn = d m A n — d n A m + g[A rn , A n ] and the topological charge is 

2 r 

k = / d 4 xtr N F mn *F mn e Z (2.2) 

with *F mn = \e mn kiFki. In these conventions, characteristic of the mathematical instanton literature, 
the gauge field A m is anti-Hermitian and so the covariant derivative is D m = d m + gA m . 

Instantons are the finite action solutions of the classical equations-of-motion which consequently 
satisfy a first-order equation. From the inequality 



d 4 xtr N (F mn ±*F mn ) <0, (2.3) 
one establishes a lower bound on the real part of the action: 



1 ^2 - 87T 2 



, , d 4 xtT N F^ n >— \k\ , (2.4) 
z J 9 

with equality when the gauge field satisfies the the self-dual, for k > 0, or anti-self-dual, for k < 0, 
Yang Mills equations: 

Fmn = ~^mnklF} t l zizF mn . (2.5) 

In our convention, instantons are the self-dual solutions and carry positive topological (instanton) charge 
k > 0. In contrast, anti-instantons satisfy the anti-self-dual Yang-Mills equations and carry negative 
topological charge k < 0. Their action is 

5= — LI + ike = I 2.6 
g 2 I -2inkr* k < , v 1 
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where we have defined the complex coupling 

T = ^ + r«- (2J ) 

In this review we will discuss exclusively instantons, rather than anti-instantons. The fundamental 
problem then, and the main subject of §11.4, is to find all the solutions to (2.5) with a + sign. When 
discussing the instanton calculus it is very convenient to introduce a quaternionic notation for four- 
dimensional Euclidean spacetime. The covering group of the SO (4) Euclidean Lorentz group in four 
dimensions is SU(2) l x SU(2)#. A 4- vector x n can be rewritten as a (2, 2) of this group with components 
x a a (or x aa ). Here, a, a = 1,2 are spinor indices of SU(2)l and SU(2)#, respectively. The explicit 
relation between the two bases is 

where a na a are four 2x2 matrices a n = (if, 1[ 2 ] X [2])> where r c , c = 1, 2, 3 are the three Pauli matrices. 
In addition we define the Hermitian conjugate matrices a n = a\ l = (—if, 1[ 2 ] X [2]) with components cr"". 4 
As explicit 2 x 2-dimensional matrices 

ix 3 + x A ixi +x 2 \ _«„ = f-ix 3 + x 4 -ixi - x 2 \ ^ g x 

ixi — x 2 —ix 3 + x 4 J ' ix\ + x 2 ix 3 + x 4 J 

Notice that derivatives are defined in the same way: 

d aA = <W9„ , d™ = G™d n . (2.10) 

But note with this definition d a a ^ d/dx a a- 
We now introduce the Lorentz generators, 

which are, respectively, self-dual and anti-self-dual: 

&mn = ^mnkl^kl i ^mn = ^mnkl^kl ■ (2-12) 

In terms of these, we write down, in regular gauge, the explicit one- instanton solution of the SU(2) 
gauge theory, known as the BPST instanton [1], as 

a _ ^-i ^Qe ~ A) m cr mn 
(x — xy + p l 

Note the identification SU(2) gauge indices and those of SU(2)x introduced above, reflecting the fact 
that the instanton configuration breaks the product of these groups down to a diagonal subgroup. The 
instanton gauge-field above depends on five parameters: one scale-size p and the four-vector instanton 
position X m . Performing global SU(2) gauge-rotations of the right hand side of (2.13) the total number 
of free parameters of a single-instanton solution becomes eight. This is the first example of the emergence 

4 Notice that indices are raised and lowered with the e-tensor defined in Appendix A: x aa — e a/3 e a ^x f3 0. 
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of instanton collective coordinates. We will see that in general a /c-instanton solution in the SU(iV) 
theory will contain 4kN collective coordinates (for the SU(2) theory and k = 1 we have 4kN = 8). It 
is straightforward to see that the corresponding field-strength 

^ V$W ' (2 - 14) 

is self-dual. The anti-instanton is obtained from expressions above via a substitution a mn — * a mn . 

Note that the BPST instanton (2.13) is a non-singular expression which falls off at large distances 
as 1/x. This slow fall off would make it difficult to construct square-integrable quantities involving 
instanton gauge fields (see [2] for more detail). An elegant and straightforward resolution of this tech- 
nical problem is to gauge-transform the regular instanton (2.13) with a singular gauge transformation 
U(x) = (J m [x — X) m /\x — X\. The resulting expression for the instanton in singular gauge is 

a _ „-i 2p 2 (x — X) m a mn 

n ~ 9 (x-xy((x-xy + p i) ■ (2 - 15j 

Note in singular gauge the SU(2) gauge indices are identified with SU(2)r indices in contrast to the 
expression in regular gauge. This expression falls off as 1/x 3 at large distances which improves the 
convergence of various integrals. 5 Hence, from now on we will always assume that all the multi-instanton 
solutions we are dealing with are written in singular gauge. The price to pay for this is the apparent 
singularity of the solution (2.15) at the instanton centre x m = X m . However, this singularity is not a 
problem, since it is, by construction, removable by a gauge transformation. A rigorous way of dealing 
with a singular gauge transformation is to introduce it on a punctured Euclidean space with the singular 
point (s) being removed. Then the singular-gauge instanton remains regular on the punctured space. 
Note that the punctures contribute to the boundary, hence the integrals of total derivatives, such as 
the instanton charge (2.2), will receive contributions from these punctures. 



II. 2 Collective coordinates and moduli space 

One of the key concepts associated to instantons (and more generally to solitons) is the idea of a 
moduli space of solutions. For instantons, this is the space of inequivalent solutions of the self-dual 
Yang-Mills equations (2.5). Since we are dealing with a gauge theory, the word "inequivalent" requires 
some clarification. It is most convenient to think of "inequivalent" as being equivalence up to local 
gauge transformations. So, for instance, solutions differing by global gauge transformation will be 
deemed inequivalent to each other. The reason for the distinction between local and global gauge 
transformations is discussed in [9]. In the present context the main point is simply that the usual 
covariant gauge fixing condition does not fix global gauge transformations and hence we must still 
integrate over the corresponding orbits in the path integral. If one is calculating a correlation function 
of gauge-invariant operators, this integral simply leads to an additional factor of the volume of the 
gauge group. 

5 The fast fall off of the singular gauge is also required for applying the LSZ reduction formulae to various Green 
functions in the instanton background. 
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It will turn out that the moduli space of instantons has a lot of mathematical structure, but for the 
moment, there are two properties that are paramount. Firstly, since finite action classical solutions of 
gauge theories on four-dimensional Euclidean space are classified by the topological (instanton) charge 
(2.2), the moduli space must contain distinct components describing the inequivalent solutions for each 
topological charge k. When the gauge group is SU(iV) (or U(iV) since the abelian factor makes no 
difference to the instanton solutions on a commutative Euclidean space) we will denote the moduli 
space of instantons with topological charge k as VJl k - The second property that these moduli spaces 
have is that they are manifolds, a fact which is not a priori obvious. Strictly speaking, as we shall see 
later, they have conical-type singularities (which occur physically when instantons shrink to zero size) 
and so we will use the term manifold in a slightly looser sense to encompass spaces with these kinds 
of features. We shall also see that Wl k has even more structure: it is a complex manifold of a very 
particular type known as a hyper-Kahler manifold. 

Since the moduli space is a manifold, albeit with singularities, we can introduce local coordinates 
to label its points. The coordinates on the moduli space label various collective properties of the gauge 
field and are therefore called collective coordinates. So the gauge fields of the instanton A n (x; X) depend 
not only on the coordinates on M. 4 , x n , but also on a set of collective coordinates that we denote X M , 
/i — 1,2, . . . ,dim9Jtfe. Some of the collective coordinates have an obvious physical interpretation; for 
instance, if we have have a given instanton solution, then since the solution is localized in M 4 it has a 
definable notion of centre. We can obviously translate the whole configuration in M 4 and so there must 
be collective coordinates that specify the position of the centre which we denote as X n , n = 1,2,3,4. 
Notice that, by symmetry, the gauge fields can only depend on these coordinates through the difference 
x n — X n . To reflect this, the moduli space 9Jl k is a product 



The component fXJlk, with the centre factored out, is known as the centred moduli space. 

Notice that the collective coordinates X n arise because, by its very nature, a given instanton solution 
breaks the translational symmetry of the theory. Another way to say this is that the subspace spanned 
by the collective coordinates X n around a given point in the moduli space can be generated by acting 
on the instanton solution with the group elements of the broken symmetry. In this case the symmetry 
in question are translations in spacetime. So if T x = —X n d/dx n are the generators of translations, then 



This is all quite trivial for the translational collective coordinates; however, it illustrates a general 
principle: there are collective coordinates associated to all the symmetries of the gauge theory that 
are broken by a given instanton solution. However, not all symmetries of the gauge theory give in- 
equivalent collective coordinates and not all collective coordinates correspond to broken symmetries: 
indeed typically the majority do not. Another way to phrase this is that the symmetries of the classical 
equations-of-motion are realized as symmetries of the moduli space and different symmetries may sweep 
out the same subspace of the moduli space while some symmetries may act trivially on (leave invariant) 
the moduli space. Finally, there may be some directions in the moduli space, and typically the majority, 
that are not related to any symmetry. 



Tl k = M 4 x Wl k . 




A n (x; X, . . .) = e Tx A n (x; 0, . . .) = A n {x - X; 0, . . .) . 



(2.17) 
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We now describe the symmetries of our theory. First of all, we have spacetime symmetries including 
the Poincare symmetry of four-dimensional Euclidean space. However, classical gauge theory is actu- 
ally invariant under the larger group that includes conformal transformations. In all, this group has 
fifteen generators which includes four translations, six rotations, four special conformal transformations 
generated by 

5x n = 2x n (x ■ e) — e n x 2 , (2-18) 

where e n is an infinitesimal four- vector, and dilatations 5x n = ax n . In quaternionic language where the 
vector x n is represented as the 2x2 matrix x a a in (2.9), the action of the whole conformal group can 
be written elegantly as 



x 



x = (Ax + B)(Cx + D)- 1 , det ^) = 1 , (2.19) 



where A, B, C and D are quaternions. Notice that there are fifteen variables in the transformation 
(2.19), matching the dimension of the conformal group. As well as these spacetime symmetries, there 
are global gauge transformations in our chosen gauge group SU(iV). Some of these symmetries will, 
like translations, be broken by an instanton solution, and consequently be realized non-trivially on the 
moduli space. 

Another key concept that is related to idea of a collective coordinate is the notion of a zero mode. 
Suppose that A n (x) is an instanton solution. Consider some small fluctuation A n (x) + 5A n (x) around 
this solution which is also a solution of the self-dual Yang-Mills equations (2.5). To linear order 

V m 5A n - V n 5A m = e mnk {D k 8Ai . (2.20) 

There are actually three independent equations here, manifested by writing them in quaternionic form: 

r^^ a 5A a6l = 0, (2.21) 

where r are the three Pauli matrices and we define the covariant derivatives in the quaternionic basis 

as 

Tp = a n V n , f = a n V n . (2.22) 

We must also fix the gauge in some way to weed out the variations that are just local gauge transfor- 
mations rather than genuine physical variations of the instanton solution. This is conveniently achieved 
by demanding that all fluctuations, including the zero modes, are orthogonal to gauge transformations. 
Here, orthogonality is defined in a functional sense by means of the inner product on adjoint- valued 
(anti-Hermitian) variations 

(5A n ,5'A n ) = -2 J d 4 x tr N 5A n (x)5'A n (x) . (2.23) 

Implicitly, we are discussing fluctuations which are square-integrable with respect to this inner product. 
So if 5A n is orthogonal to a gauge transformation, f d 4 xtr N V n Q 5A n = 0, then by integrating-by-parts 

V n 5A n = . (2.24) 

Again it is quite instructive to write this in quaternionic form: 

W a 5A a6l = . (2.25) 
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The conditions for a zero mode, (2.21) and (2.25), can then be written as a single quaternion equation: 

f &a 5A a$ = . (2.26) 

We recognize this as the covariant Weyl equation for a Weyl spinor ip a = SA a/ j in the instanton 
background. Due to the free (5 index each gauge zero mode actually corresponds to two independent 
solutions of the Weyl equation. 

The fluctuations SA n (x) satisfying (2.20) and (2.24), or (2.26), are then "zero modes" in the sense 
that they represent physical fluctuations in field space which do not change the value of the action. 
Non-zero mode fluctuations necessarily increase the action of the instanton solution. To quadratic order 
around the instanton solution, one finds 



(2.27) 



S[A] = -2-uikr + J d 4 xtr N SA m (V 2 S mn + 2gF mn )5A n + 
= -2nikr -^Jd A x tr N SA^A^JSA^ + ■■■ , 

where we have defined the fluctuation operator A^ + \ and, for future use, a companion A(-) : 

A (+) = -ff = -l [2]x[2] V 2 - gF mn a mn , (2.28a) 
A (-) = -fp= -1 [2]X[2] V 2 - gF mn a mn . (2.28b) 

The equality between the two expressions in (2.27) is established by using the fact that a mn is a projector 
onto self-dual tensors (A. 9). In addition, in an instanton (as opposed to an anti-instanton) background, 
the second term in A^ - ) vanishes; hence 

A ( -) = -l [2]x[2] V 2 . (2.29) 

The two operators A^) play an important role in governing the behaviour of fluctuations around the 
instanton in the semi-classical approximation. A important result is that in an instanton (self-dual) 
background A^ + ) has zero modes, whereas, by virtue of (2.29), A^ - ) is a positive semi-definite operator 
and therefore has no normalizable zero modes. 

The zero modes are associated to the collective coordinates in the following way. Consider the 
space of solutions A n (x; X) where x n are the Euclidean spacetime coordinates and are the collective 
coordinates. The derivative of the gauge field with respect to a collective coordinate, dA n /dX^, is 
guaranteed to satisfy the zero mode equation (2.20): in other words it is a potential zero mode. In 
order to ensure that it is a genuine zero mode, we have to satisfy the gauge condition (2.24). The way 
to achieve this is to notice that (2.20) is trivially satisfied by a gauge transformation 5A n = D n Q, for 
some function Q in the Lie algebra of the gauge group. We can then consider a linear combination of 
the derivative by the collective coordinate and a compensating gauge transformation: 

5,A n (x; X) = dA j^ X) - VM,(x) . (2.30) 

The parameter of the gauge transformation Q^(x) is chosen in order that (2.24) is satisfied by 
hence 
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The quantity 5^A n is then the "genuine" zero mode associated to X^ 1 . For the case of instantons, as 
long as we work in a special gauge known as singular gauge (as in Eq. (2.15)), there are no subtleties 
associated with normalizability and 5^A n are all square integrable. This rather convenient choice of 
gauge will be described in §11.4.3. Subsequently in §11.5 we will show in singular gauge that <5 M A n is 



We have seen that all collective coordinates are associated to zero modes. However, when gauge fields 
couple to other fields, it can happen that instanton zero modes are no longer associated to collective 
coordinates because they can fail to integrate from solutions of the linearized problem to solutions of 
the full coupled equations. Often we find that zero modes of the linearized problem are lifted by higher 
order interactions or by external interactions (for instance when scalar fields have VEVs). In these 
circumstances it is still useful to introduce the notion of quasi- collective coordinates which are lifted by 
a non-trivial effective action. The quasi-collective coordinates give rise to the notion of a quasi-instanton 
which only satisfies the classical equation-of-motion up to a certain order in the coupling. 

II. 3 General properties of the moduli space of instantons 

The moduli space of instanton solutions plays a central role in our story so it is important to describe 
in some detail how one arrives at a description of it. The story is necessarily rather mathematical, but 
the final answer, known as the ADHM construction, after Atiyah, Drinfeld, Hitchin and Manin [7] is 
a great mathematical achievement which we will review in §11.4. The ADHM moduli space has a lot 
of structure that can be deduced from general considerations. For example, the dimension of 971^ can 
be obtained by simply counting the number of zero modes at a point in the moduli space using the 
Atiyah-Singer Index Theorem [14] (for applications to instantons see [15]). The result is that 9Jtfc has 
real dimension AkN . 

II. 3.1 The moduli space as a complex manifold 

We have argued that the moduli space 971^ is a space with dimension 4kN. In fact it is a Riemannian 
manifold endowed with a natural metric defined as the functional inner product of the zero modes (in 
singular gauge), 6 



This metric plays an important role in the theory since, as we establish in §111, it defines the volume 
form on 9Jt fe that arises from changing variables in the path integral from the gauge field to the collective 
coordinates. However, we shall find that various other quantities that are derived from the metric, like 
the connection and curvature, also have an important role to play in the instanton calculus. 

In fact there is more structure to the moduli space than simply the existence of a metric. It turns 
out that it is also a complex manifold of a very particular kind; namely a hyper-Kahler space (with 

6 The factor of g 2 is inserted here so that the metric is independent of the coupling. 





(2.32) 
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singularities). A short review of some relevant aspects of such spaces is provided in Appendix B. 
Fundamentally, these spaces admit 3 linearly independent complex structures c = 1,2,3, that 
satisfy the algebra 

j(c)j(d) = _ S cd + e cdej{e) _ ^33) 

— * 

Often we will represent them as a three-vector i\ The key idea for constructing the triplet of complex 
structures on arises from noticing that Euclidean spacetime is itself hyper-Kahler. The three 
complex structures of M 4 can be chosen so that = —ff mn -, where ff mn is a 't Hooft r/-symbol defined 
in Appendix A. In the quaternion basis x^a 

(I ■ xU = ix a$ r^ , (J • xf a = -if^x^ . (2.34) 

These descend to give the three complex structures on 07t fc in the following way. Notice that in the zero 
mode equation (2.26) $ is a free index. This means that if S^A^ is a zero mode then so is S^A^G^a, 
for any constant matrix G. In particular, if b^A^ is a zero mode then so is (I ■ S^A)^ = iS^A^r^- 

— * 

Since the zero modes form a complete set, there must exist I v ^ such that 

(J • S^A) a a = 5 u A a aI v n , (2.35) 
from which it follows that the triplet I u ^ satisfies the algebra (2.33). 

At this stage, T '„ are almost complex structures because we have not proved that they are integrable 
(B.l). Rather than prove this directly we will follow the analysis of Maciocia [16] and construct a hyper- 
Kahler potential for 97lfe. This proves that DJlk is not only hyper-Kahler but also it is a rather special kind 
for which each complex structure shares the same Kahler potential (called the hyper-Kahler potential). 
The expression for the potential is [16] 

X = ~J d 4 xx 2 ti N F 2 mn . (2.36) 

In order to prove this, we pick out one of the complex structures of M 4 . Choose holomorphic 
coordinates {z l ,z % ), % — 1,2, with respect to this particular complex structure. For example, choosing 
we have z l = ix 3 + x 4 and z 2 = ix 1 — x 2 . The complex structure is associated, via (2.35), with 
a complex structure on We can then choose a set of matching holomorphic coordinates on OTt^. 

(Z\ Z 1 ), i = 1, . . . , |dim9Jlfc for which the complex structure on the moduli space is 

' w = Co' -«,) ■ ^ 

Then, 



OZ^aa dZ l ' V dZ l Jaa 8Z l 

For example, for we have 

^ = , ^ = . (2.39) 

dZ l dZ l K ' 

Furthermore, the variation equation (2.20) implies that the derivatives above automatically satisfy the 

background gauge condition (2.25). Consequently 

- H , Un - ^ (2.40) 
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are zero modes directly without the need for a compensating gauge transformation (2.30). This fact 
turns out to be crucial. Furthermore, 

ff A 

7^?- = , (2.41) 
dZ^dZ 1 ' v ; 

as is clear from (2.39) for the case of I^. 

By explicit calculation, using the zero mode condition (2.26) and (2.41), one finds 

d 2 



. } 7}})Z i^n Fi n = n trjy SiAJj A n - 2d m d n tr N 5 { A m ^A n . (2.42) 

Using this in (2.36) and integrating-by-parts twice, discarding the surface terms since the zero modes 
decay as 0(x~ 3 ), we have 

~y = -2g 2 J d A x tr N 5,An $4, • (2.43) 

By comparing with (2.32), we see that the above expression is a component of the metric on the space 
of zero modes: 

M = mb d * d2 >- (2 - 44 > 

This proves that x is the Kahler potential for the complex structure 1^ for c = 3. However, x manifestly 
does not depend on the choice of the index c = 1,2,3 and so it is a hyper-Kahler potential, and by 
implication 9Jl k is hyper-Kahler space. 



II. 4 The ADHM construction of instantons 



In this section, we describe the construction of instantons due to Atiyah, Drinfeld, Hitchin and Manin 
(ADHM) [7] and how this leads to a description of DJlk for which the hyper-Kahler property is manifest. 
This remarkable construction of ADHM was originally discussed in Refs. [17-19]. Here we follow, with 
minor modifications, the SU(A r ) formalism of Refs. [20,21]. Our approach here is to describe the ADHM 
construction as an ansatz for producing instanton solutions and we direct the reader to the references 
above for the more mathematical technicalities. 

The basic object in the ADHM construction is the (N + 2k) x 2k complex- valued matrix A\ia which 
is taken to be linear in the spacetime variable x n \ 

A Xl& (x) = a Xl& + b a Xl x aA , A? \x) = a? A + x« a b* a . (2.45) 

Here, we have introduced 'ADHM indices" A, /i . . . = 1, . . . , N + 2k and "instanton indices" i, j, . . . = 
1, . . . , k and used the quaternionic representation of x n as in (2.8) and (2.9). By definition the conjugate 
is 7 

Af A = (A Aid )* . (2.46) 

7 Throughout this, and other sections, an over-bar means Hermitian conjugation: A = At. Notice as usual we have to 
distinguish between upper and lower spinor indices, and hence ADHM indices, but not for the other types of index. 
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We will soon verify by direct calculation that k is the instanton charge of the solution. As discussed 
below, the complex- valued constant matrices a and b in (2.45) constitute a (highly over complete) set 
of collective coordinates on dJtk- 

Generically, the null-space of the Hermitian conjugate matrix A(x) is N- dimensional, as it has iV 
fewer rows than columns. The basis vectors for this null-space can be assembled into an (N + 2k) x N 
dimensional complex- valued matrix U\ u (x), u — 1, . . . , N: 

Af x U Xu = = U x A Xt& , (2.47) 

where U is orthonormalized according to 

U x U Xv = 5 UV . (2.48) 

The construction requires a non-degeneracy condition: the maps A A (x) : C k — > C N+2k must be injective 
while the maps A a (x) : C N+2k — > C k must be surjective. Having said this, we will see in §11.6, that 
points where the non- degeneracy conditions break down have an interesting physical interpretation. 



In turn, the classical ADHM gauge field A n is constructed from U as follows. Note first that for the 
special case k — 0, the antisymmetric gauge configuration A n defined by 8 

(A n ) uv = g-WfaUxv (2.49) 

is "pure gauge" so that it automatically solves the self-duality equation (2.5) in the vacuum sector. The 
ADHM ansatz is that Eq. (2.49) continues to give a solution to Eq. (2.5), even for nonzero k. As we 
shall see, this requires the additional condition 

A? A A Aj/3 = 5%(f-% , (2.50) 

where / is an arbitrary x-dependent k x k dimensional Hermitian matrix. Note that the existence of 
the inverse f~ l is guaranteed by the non-degeneracy condition. 



To check the validity of the ADHM ansatz, we first observe that Eq. (2.50) combined with the 
null-space condition (2.47) imply the completeness relation 

Vx 11 = U Xu UZ = ^ - A.v„/- V A7 . (2.51) 

Note that V, as defined, is actually a projection operator; the fact that one can write V in these 
two equivalent ways turns out to be a useful trick in ADHM algebra, used pervasively throughout the 
instanton calculus. With the above relations the expression for the field strength F mn may be massaged 
as follows: 

F m n = d m A n - d n A m + g[A m , A n ] = g-^UUd^U) + g-\Ud [m U)(Ud n] U) 
= g~ l d [m U(l - UU)d n] U = g' l d [m UAfAd n] U 
= g- 1 Ud [n Afd n] AU = g- 1 ma [m a n] fbU 

= Ag^UbamnfbU . 

8 Frequently, where confusion cannot arise, we do not indicate all the indices. Often only the spinor indices have to 
labelled explicitly. 
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Self-duality of the field strength then follows automatically from the self-duality property of the tensor 
a mn (2.12). 

The instanton number of the ADHM configuration can be calculated directly using a remarkable 
identity of Osborn [8] for trjv F 2 in the ADHM background proved in Appendix C: 



9 2 



16tt 2 



J d 4 xtr N F 2 mn = -±- 2 J d A x D 2 tr k log / . (2.53) 



Now from (2.50) one can deduce the asymptotic form for f(x) for large x, f(x) ^ x 2 l [k]x[k] , and 
therefore the right-hand side is equal to k as claimed. 

Let us analyse the factorization condition (2.50) in more detail. Noting that fij(x) is arbitrary, one 
extracts three x-independent conditions on a and b: 

of X a Xj $ = (¥ a )ij S % > ( 2 - 54a ) 
a?% = \^a% , (2.54b) 
= (2.54c) 

These three conditions are generally known as the "ADHM constraints" [18, 19]. 9 They define a set of 
coupled quadratic conditions on the matrix elements of a, a b and b. Note that (2.54b) and (2.54c) can 
be combined in the useful form 

A*b a = b a A« . (2.55) 

The fact that the ADHM construction involves non-linear constraints presents considerable difficulties 
for practical applications. However, it turns out, as we shall we shall see in §VI.l, that the ADHM 
constraints can be resolved in a simple way, at least generically, when iV > 2k. 

The elements of the matrices a and b comprise the collective coordinates for the fc-instanton gauge 
configuration. Clearly the number of independent such elements grows as k 2 , even after accounting for 
the constraints (2.54a)-(2.54c). In contrast, the number of physical collective coordinates should equal 
4kN which scales linearly with k. It follows that a and b constitute a highly redundant set. Much of this 
redundancy can be eliminated by noting that the ADHM construction is unaffected by ^-independent 
transformations of the form 

A -> AAT" 1 , U — > AU , / -> T/T f , (2.56) 



provided A e U(A^ + 2k) and T e Gl(k, C). Exploiting these symmetries, one can choose a repre- 
sentation in which b assumes a simple canonical form [18]. Decomposing the index A = u + ia 

9 We should warn the reader that we use the term "ADHM constraints" in a rather more restricted sense after a 
canonical choice has been made below. 
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The remaining variables all reside in a and we will split them up in a way that mirrors the canonical 
form for b: 

ax, a = a (u+ia)j& = , af = af u+ia) = (wf u (fi'*%) , (2.58) 

With b in the canonical form (2.57), the third ADHM constraint of (2.54c) is satisfied automatically, 
while the remaining constraints (2.54a) and (2.54b) boil down to the k x k matrix equations: 

T^^aa) = (2.59a) 
(a'J = a' n . (2.59b) 

In (2.59a) there are three separate equations since we have contracted aPa^ with any of the three Pauli 
matrices, while in Eq. (2.59b) we have decomposed a' a6l and a' aa in our usual quaternionic basis of spin 
matrices (2.8): 

= a' &a = <af • (2.60) 

In the canonical form (2.57) we also have the useful identity 

by = Sjl [k]x[k] (2.61) 

and the ADHM matrix / takes the form 

/ = 2(w"wa + (a' n + x n l [k]x[k] ) 2 y l . (2.62) 

Note that the canonical form for b (2.57) is preserved by a residual U(A;) subgroup of the U(iV + 
2k) x Gl(fc,C) symmetry group (2.56): 

A= f 1 ™ ° ), T = S, EeV(k). (2.63) 

These residual transformations act non-trivially on the remaining variables: 

w U ia -> , <4 -> S f a' n S . (2.64) 

Henceforth, we shall use exclusively the streamlined version of the ADHM construction obtained 
by fixing b as in (2.57). The basic variables will be = {iVa,a' n } where we will automatically assume 
(2.59b) so that the four k x k matrices a' n are defined from the outset to be Hermitian. The remaining 
constraints (2.59a) 

t*. = (w^w„ + a'^Xa) = (2.65) 
will be called the "ADHM constraints" . 



II. 4.1 The ADHM construction as a hyper-Kahler quotient 



It follows from the ADHM construction of the solutions of the self-dual Yang-Mills equations that the 
moduli space DJlk is identified with the variables subject to the ADHM constraints (2.65) quotiented 
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by the residual symmetry group U(fc) (2.64). Subsequently it was realized that the ADHM construction 
was actually an example of a more general construction that has become known as the hyper-Kahler 
quotient [22]. This construction is reviewed in Appendix B and provides a way of constructing a new 
hyper-Kahler space 97t starting from a "mother" hyper-Kahler space 97t with suitable isometries. In 
the case of the ADHM construction, the mother space 971 is simply the Euclidean space ^ 4k ( k + N ) with 



coordinates and metric 



g = A7r 2 (2dwf u dw ui a + d(a /<k %' d{a' a& ) ji ) = 87r 2 tr fc (dw* dwa + da' n da' n ) . 



(2.66) 



The normalization factor here will be justified in §11.5. Flat space with dimension a multiple of four is 
trivially hyper-Kahler as described in Appendix B. The tangent space of a hyper-Kahler space of real 
dimension 4n, admits a distinguished Sp(ra) x SU(2) basis of tangent vectors. For flat space R 4fc ( 7V + fc ) ) 



this symplectic structure is realized by a set of coordinates z ia , i 
terms of which the flat metric is 

// <>:: (n //: i %/- i( . 

where Cl is a symplectic matrix. In the present case, where 971 = 
coordinates and matrix Q are 



= 1, . . . , 2k(N + k) and a = 1, 2, in 

(2.67) 

M 4fc(7v+fc) ig identified with a & , the 
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(2.68) 
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where the Roman indices i, j, . . . each run over the set of 2k(N + k) composite indices {iu, ij, ui, ij}. 



The ADHM construction is the hyper-Kahler quotient of 971 = K 4fc ( 7V + fc ) by the isometry group U(k) 
which acts on the variables as in (2.64). This action defines a set of tri-holomorphic Killing vector 
fields on 971: 



iT r w h 



d 



+ i[T r ,a' r 



d 



where T r is a generator of U(fc) in the fundamental representation. 10 



(2.69) 



There are two main parts to the quotient construction. Firstly, one restricts to the level set 9T C 971, 
defined by the vanishing of the moment maps associated to the isometries, in this case the U(k) vector 
fields (2.69). The expressions for the moment maps are given in (B.39), using (2.68) and (2.69), the 
moment maps are 

= 47r 2 f^tr fe (T r a«ap) - ( r . (2.70) 

— # 

The central elements ( r can, in this case, take values in the Lie algebra of the U(l) factor of the gauge 
group. One notices immediately that the ADHM constraints (2.65) are precisely the conditions \x Xr = 
(with vanishing central element). In other words, the ADHM constraints explicitly implements the first 
part of the quotient construction. The second part of the quotient construction involves an ordinary 
quotient of 91 by the U(k) action (which is guaranteed to fix 9T). But this quotient, as we have seen, is 
also an essential ingredient of the ADHM construction. 
10 We will choose a normalization tv k T r T s = S rs . 
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The conclusion is that the ADHM construction realizes the instanton moduli space Wl k as a hyper- 

— # 

Kahler quotient of flat space by a U(k) group of isometries (with vanishing central elements ( r = 0). 
Later in §XI.l, we shall describe what the physical interpretation of taking a non- vanishing ( r . The 
dimension of the quotient space DJtk is 

dim m k = dim Ml- 4 dim U(Jfe) = 4kN , (2.71) 

as anticipated by the Index Theorem. 

The importance of the hyper-Kahler quotient construction is that geometric properties of DJlk are 
inherited from the mother space 971 in a rather straightforward way (as described in Appendix B). For 
example of particular importance will be the metric on DJlk and we will now describe in some detail how 
to construct it. First of all, we focus on T9T, the tangent space of the level set. Locally as described in 

~ (c) (c) 

Appendix B, this is the subspace of T9Tt orthogonal to the 3k 2 vectors i" X r , where I are the three 
independent complex structures of the mother space Then we have the following decomposition 

T9t = V©IK, (2.72) 

where the horizontal subspace "K is the subspace orthogonal to the vectors X r and the vertical subspace 
V is the orthogonal complement. The tangent space of 9Jtfc is identified with the quotient TVt/V. This 
means that each X e T9JT has a unique lift to IK, which, by a slight abuse of notation, we denote by 
the same letter. The metric on the quotient g(X, Y) is then identified with the metric on 9JT, g(X, Y), 
evaluated on the lifts to K. 

What we would like to show is that the metric on DJlk inherited from the quotient construction is 
equal to the metric on Ml k that arises from the functional inner product of zero modes (2.32). Rather 
than implement the procedure that we describe above explicitly, it is more straightforward to use the 
fact that the hyper-Kahler spaces that we are considering are of a special class which admit a hyper- 
Kahler potential. We have already determined in (2.36) the form of this potential for the metric arising 
from the inner product of zero modes. Is this equal to the hyper-Kahler potential arising from the 
quotient construction? First of all, the mother space 9JI trivially admits such a potential with 

X = a.,, ^ = 87r 2 tr fc (w*w & + a! n a' n ) . (2.73) 

The hyper-Kahler potential on the quotient space DJlk is then simply obtained by finding a parameter- 
ization of the ADHM variables in terms of the coordinates {A' 1 } on 971^. In other words z ia (X), or 
a«(A), which solves the ADHM constraints and for some choice of gauge slice for the U(fc) action on 
the level set 9T. The hyper-Kahler potential for 971^ is then obtained by restriction 

X (X) = x{z ia {X)) = 8n 2 tr k (w«(X)wa(X) + a' n (X)a' n (X)) . (2.74) 

We now evaluate (2.36) using Osborn's formula (C.6) for trjy F^ n , giving 

X = d 4 xx 2 n 2 tr k \ogf . (2.75) 

Applying Gauss's Theorem along with the ADHM form for the matrix / (2.62) and (2.45), one finds 
precisely (2.74). 
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II. 4. 2 Symmetries and the moduli space 



We now show how the symmetries of the classical gauge theory broken by an instanton solution are 
realized on the ADHM moduli space. Consider, first of all, the action of the conformal group on the 
instanton solutions. The action of the conformal group is given most elegantly in the quaternion basis 
(2.19). So acting on the ADHM variable A(x) we find 

A(x'; a, b) = A(x; aD + bB, aC + bA)(Cx + D)' 1 . (2.76) 

Notice that since the gauge field only depends on U and U, defined by (2.47), the factor of (Cx + -D) -1 
on the right is redundant. Hence, the action of the conformal group on the ADHM variables is 

a^aD + bB, b^aC + bA. (2.77) 

To get the transformation on our canonical basis (2.57) and (2.58) we have to perform a compensating 
transformation of the form (2.56) in order to return b to its canonical form (2.57). So there exist 
transformations of the form (2.56), with A and T dependent on a and the element of the conformal 
group, that takes 

A{aC + bA)T^ = b , (2.78) 

where b assumes its canonical form (2.57). The resulting action of the conformal group on the ADHM 
variable a is then 

a — > A(aD + 65)T _1 . (2.79) 
For example, translations act on the ADHM coordinates in the following way. From 

A(x + e;a,b) = A(x;a + be,b) , (2.80) 

we deduce 

a 'n -»■ a 'n + enl[fc]x[fc] , ™d -> Wa ■ (2-81) 

This allows us to identify the coordinates of the centre of the instanton with the component of a' n 
proportional the identity matrix: 

X n = -k- 1 tT k a' n . (2.82) 

Note that these centre-of-mass coordinates do not appear in the ADHM constraints (2.65) reflecting of 
the fact that the moduli space is the product (2.16). 

Global gauge transformation act on the gauge indices u, v, . . ., so only on the quantities UW- Gener- 
ically, w ui a constitute a set of 2k complex A^-vectors. Consequently if N < 2k then all global gauge 
transformation generically act non-trivially on the ADHM variables while if N > 2k then there is a 
non-trivial subgroup that leaves the instanton fixed. This is the stability group of the instanton. In order 
to identify it, we follow Bernard's description of the one instanton moduli space [23] and embed the 
k instanton solution in an SU(2/c) dimensional subgroup of the gauge group. This involves choosing a 
suitable gauge transformation that puts the N x2k matrix w, with elements uw, into upper-triangular 
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form: 



An 62 
62 



/ w u ■ 
\wm ■ 



W Ny2 k ) 



IX- 



£l,2fc \ 
£>2,2k 



^2fc,2fc 





(2.83) 



\ ••• / 

The (,ab, a, 6 = 1, ... , 2 A;, are complex except for the diagonal elements £ aa which we can choose to be 
real. Note the group elements IX in the SU(iV — 2k) subgroup in the lower (N — 2k) x (TV — 2k) corner 
leave £ invariant. Therefore, at least generically IX is valued in the coset 

U SV(N-2k) ■ (2 ' 84) 

There are AkN independent real parameters in w on the left-hand side of (2.83), matching Ak(N — k) 
in IX and Ak 2 in £, on the right-hand side. 

With the parameterization (2.83), the instanton solution has the form 

A _ T ft ( (^n St )[2fc]x[2fc] 0[2k]x[N~2k 

n ~ \ n n 

\ u [iV-2fe]x[2fe] u [AT-2fe]x[JV-2fc] / 

Where A™ st is the /c-instanton solution lying, generically, in SU(2A;) C S\J(N). Generically, the stability 
group of the instanton solution consists of the SU(iV — 2k) in the denominator of (2.84) along an 
additional U(l) transformation generated by 



IX 



(2.85) 



A 



L [2k]x[2fc] 



0, 



0[2fe]x[AT-2fc] 
2k 1 
: i[ 



(2.86) 



>[N-2k\x[2k] N—2k - L [A f -2fe]x[iV-2fc] / 

The stability group, therefore, consists of elements e tXe g, with g e SU(A^ — 2k). Notice that this is not 
exactly SU(iV — 2k) x U(l) because elements elements e tXei and e lX62 with 9 1 — 9 2 = it differ by an element 
of the centre of SU(iV - 2k). More precisely, therefore, the stability group is S(U(iV - 2k) x U(l)) C 
SU(iV). To summarize: the non-trivial global gauge transformations acting on a generic instanton 
solution are 

SU(iV) 



SU(iV) for N < 2k , 



for N >2k . 



(2.87) 



S(U(iV -2k) x U(l)) 

For iV > 2k, we will loosely refer to IX, taking values in (2.84), as the "gauge orientation" of the 
instanton, even though the true gauge orientation involves quotienting by the additional U(l) described 
above. Notice, the additional U(l) can be identified with the U(l) subgroup of the U(k) auxiliary group. 



II. 4. 3 Singular gauge, one instanton, the dilute limit and asymptotics 

Let us determine the gauge field A n more explicitly. This entails solving for U, and hence A n itself via 
(2.49), in terms of A. It is convenient to make the decomposition: 

U\ v = U( uJr i a ) v = ^j^j,y ^ , A\ja — ^(u+ia)ja = ^^'^..^ ' (2.88) 
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Then from the completeness condition (2.51) one finds 

V = l [If]x[N] -w & fw & ■ (2.89) 

For any V that solves this equation, one can find another by right-multiplying it by an x-dependent 
U(iV) matrix. A specific choice of V corresponds to fixing the spacetime gauge. The "singular gauges" 
correspond to taking any one of the 2^ choices of matrix square roots: 

V=(l lN]xlN] -w & fw & ) 1 / 2 . (2.90) 

Next, U' in (2.88) is determined in terms of V via 

U' = -A'Jw^V- 1 (2.91) 

which likewise follows from (2.51). Equations (2.90) and (2.91) determine U in (2.88), and hence the 
gauge field A n via (2.49). 



We now show how the well-known (2.15) of 't Hooft for the one instanton solution in SU(2) (in 
singular gauge) is reproduced by the ADHM construction. Adopting the canonical form (2.57), we set 
the instanton number k — 1, thus dropping the instanton, i,j, indices. Eq. (2.59b) then says that a' n is 
a real 4-vector which from the last section we will identify as minus the centre — X n of the instanton, 
as per (2.82), 

a' n = -X n e M 4 . (2.92) 

The ADHM constraint (2.65) collapses to 

w>up = P 2 8%- (2-93) 

The parameter p will soon be identified with the instanton scale size. The constraint (2.93) can be 
explicitly solved by taking 

w = P U ) , (2.94) 

\U[JV-2]x[2]/ 

which is the decomposition (2.83) for one instanton and identifies U G SU(iV) as the gauge orientation 
of the instanton. For one instanton, the ADHM quantity / is a scalar; from (2.62) 

/= (,-aW - < 2 ' 95 > 




x - X) 2 + p 2 
(x - X) a& w a 



— 1 )WnW 



(2.96) 



-xW(x-xy + P 2 ' 



\x 



Using (2.49), one finds the expression for the gauge potential in singular gauge 

2wa{x - X) m a mn a pw p 
{x~Xf({x-Xf + p 2 ) 



, 2wa(x — X), m a mn a hW^ 

A n = g- \ %wr, r/ ,x • (2-97) 
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Comparing with the SU(2) solution (2.15), this form of the solution manifests the fact that a single 
SU(iV) instanton is described by taking the SU(2) instanton solution and embedding it in SU(2) C 
SU(iV). In this case the three SU(2) generators are 

T c uv = P ~ 2 w u6t T c ^wt , c = 1, 2, 3. (2.98) 

The fact that these generators satisfy the SU(2) algebra is guaranteed by the ADHM constraints (2.93). 
With the explicit solution (2.94) 

A n = u( n Q |M Itt, (2.99) 

which manifests the fact that It is the gauge orientation of the instanton taking values in the coset 
SU(iV)/S(U(JV - 2) x U(l)). 

In general a multi-instanton configuration cannot be thought of as a combination of single instantons. 
However, there are asymptotic regions of 971^ where the solutions can be identified as being composed 
of well-separated single instantons. Here, we consider the completely clustered limit in which the k- 
instanton configuration looks like k well-separated single instantons. Up to the action of the auxiliary 
U(/c) symmetry, the completely clustered limit is the region of moduli space where the differences 
between the diagonal elements of a' n are much greater than the off-diagonal elements (in a sense that 
we make precise below). We can then identify X % n = — (a' n )u as the centres of each of the k single 
instantons. 

To be more specific, it is useful to fix the U(fc) symmetry by setting to zero the off-diagonal com- 
ponents of a' n that are generated by U(k) adjoint action on the diagonal matrix diag(— X*, • • • , — X%). 
The "gauge choice" amounts to taking 

(Oypr* -*'')» = (2.100) 

and we will denote the a' so constrained by a'. This leaves the diagonal symmetry U(l) fc which will be 
identified with the auxiliary symmetry of each of the k single instantons. In the complete clustering 
limit, the terms (a' aa )ik(a>' a p)kj, k ^ can be ignored in the ADHM constraints (2.65). In this case, 
these off-diagonal constraints are linear in (a' a& )ij, for i ^ j, 

(X* - XT a (~a'J tJ + w«w ujp oc 5^ (2.101) 

and can be solved, although we will not require explicit expressions for the solutions. The diagonal 
components of the ADHM constraints (2.65) in this limit are then simply 

= , (2-102) 

(no sum on i) for arbitrary p^. The constraint (2.102) is then the ADHM constraint of a single instanton 
and we can therefore identify pi with the scale size of the i th instanton. Each instanton is associated 
with a particular SU(2) embedding of SU(iV) defined by the generators 

(T t c ) uv = p; 2 w ul „T c «pwl , c = 1, 2, 3, (2.103) 

with no sum on i. One can show that the completely clustered limit is valid when, for each i ^ j, 

{X 1 - Xi) 2 » p iPj tr N (T?T?) . (2.104) 
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In other words, the separation between the instantons must be much greater than the product of the 
scale sizes times a trace over generators which measures the overlap of the SU(2) embeddings of each 
of the instantons. 

It will be important for many of the applications of the instanton calculus to describe the asymptotic 
fall off of the fields from the centre of the instanton. The nature of the fall off depends on the gauge 
used which for us means singular gauge as described above. From the previous formulae of this section 
for arbitrary instanton charge k, we find the leading large-x asymptotic behavior of several key ADHM 
quantities, in singular gauge (2.90): 

Aq, > b x a a , fij > — g Sij , U > w , V > 1[jv]x[at] (2.105) 

and in addition, for the gauge field 

A n -> j" 1 ^^/ • (2.106) 



II. 5 Zero modes and the metric on %Jlk 

The ADHM construction yields an explicit solution for the self-dual gauge field in terms of the (over- 
complete) ADHM collective coordinates a„. We now show how one can find explicit expressions for the 
zero modes. In §11.2, we showed, in the quaternion basis, that zero modes 5A a a are solutions of the 
covariant Weyl equation f) aa 5A a = 0. In Appendix C (Eq. (C.ll)) we verify that the following linear 
functions 

K{C) = UCfb a U -UbJCU , (2.107) 

for a constant (N + 2k) x k matrix C Xi , is a solution of the covariant Weyl equation in the background 
of the instanton, 

V« a A a (C) = , (2.108) 

as long as C satisfies the constraints 

Ci\d\ja = —a,ia\C\j (2.109a) 
C tX b a Xj =bt x C Xj . (2.109b) 

There are 2k(N + 2k) real degrees-of-freedom in C subject to Ak 2 constraints (2.109a)-(2.109b). Hence, 
there are 2kN independent solutions to the Weyl equation as anticipated by the Index Theorem. 

The question is how these solutions are related to the derivatives of the gauge field with respect to 
the collective coordinates? If X M is an arbitrary collective coordinate then we prove in Appendix C 
(Eq. (C.16)) that 

|^ = -g-iv (®Lu) + g -iu^-fa n bU - g-Wba n f^-U . (2.110) 
In the quaternion basis we recognize the 2nd and 3rd terms as being 2g~ 1 A a (C dl ) with = da^/dX 11 : 

= -g-^ct^U) + 2g- 1 A a (da*/dXn . (2.111) 
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The first term in (2.111), as its form suggests, is precisely the compensating gauge transformation (2.30) 
needed in order to force the zero mode into background gauge: 

dU 

% = -^g^U . (2.112) 

Hence the expression for the zero modes is given explicitly by the linear function defined in (2.107) 

5,A aa = 2g- 1 A a (daJdX») = 2g- 1 t/(^ I fb a - b a f^)u . (2.113) 

The fact that = dcia/dX^ satisfies the constraints (2.109a) and (2.109b) is partially taken 
care of by taking the X M -derivative of the ADHM constraints (2.54a)-(2.54b). From the hyper-Kahler 
quotient perspective, described in §11.4.1 and Appendix B, the conditions that arise from taking the X^ 
derivative of (2.54a) are equivalent to the requirement that da^/dX 11 is orthogonal to the 3/c 2 vectors 

~ (c) ~ 

i" X r : i.e. dcia/dX 11 lies in the tangent space T91 C T9JT. However, this leaves k 2 additional constraints 
on dda/dX^ which have not been accounted for; namely, 

^a a -a«p^ = 0. (2.114) 
dX^ a dX" v ; 

From the hyper-Kahler quotient point-of-view, these extra constraints simply require that the vector 
dcia/dX^ is orthogonal to the tri-holomorphic Killings vectors X r , (2.69), which generate the U(fc) 
isometries of This means that it lies in the horizontal subspace "K C The horizontal subspace 
is identified with the tangent space T9Jtfc by a unique lifting procedure. Hence, the zero modes are 
naturally associated to tangent vectors to QJt fc . 

It is remarkable that one can calculate the explicit functional inner product of the zero modes 
in singular gauge by using a formula attributed in Ref. [8] to Corrigan [24]. The proof, reviewed in 
Appendix C (Eq. (C.20)), was first published in [25], for gauge group Sp(l), and extended to gauge 
group SU(iV) in [21]. The identity can then be written in terms of the Weyl spinor quantities A(C) as 11 

J2 



Here 



j d i xtr N A(C)A(C') = -^-tr k [C(V 00 + l)C -C'iV^ + ^C] . (2.115) 

V OD = \imV = l-bb= JWp*A (2.H6) 

x^oo \0[2fc] X [jv] 0[ 2fe ] X [2fe]y 

From this identity, and the expression (2.113), we can deduce the expression for the metric on the space 
of collective coordinates (2.32) 

M*) = 2* 2 M||>co + Ufj| + ^(-P™ + l)f£) • (2H7) 

Since, da^/dX^ are the components of a vector in "K (due to the constraints (2.109a)-(2.109b)), the 
metric arising from the functional inner product of zero modes (in singular gauge) (2.117) is identical 

11 The relative minus sign in the second term compared to Eq. (2.61) in [21] is due to the fact that in that reference 
we have written the identity for the fermion zero modes and so M and Af are Grassmann collective coordinates, whereas 
here C and C are c-number-valued. 
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to the metric on induced by the hyper-Kahler quotient construction. This is completely in accord 
with the argument in §11.4.1 involving the hyper-Kahler potential. 

Now that we have found the explicit form of the zero modes and compensating gauge transformations 
in singular gauge, one can easily verify that for large x 

8 A 

5,A m ~ 0(x- 3 ) , ^ ~ 0{x' 3 ) , ~ 0(x~ 2 ) . (2.118) 

These formulae follow from the explicit expressions (2.110) and (2.112) and asymptotic formulae in 
(2.105). 



II. 6 Singularities and small instantons 

We have previously remarked that the instanton moduli space DJik fails to be a smooth manifold due 
to certain singularities. In this section we will explore these interesting features in more detail. Before 
we embark on this analysis, it is important to emphasize that these singularities are not evidence of 
any pathology in the instanton calculus. The integrals over the DJlk appearing in the semi-classical 
approximation of the functional integral are perfectly well defined in the vicinity of these singularities 
and in this sense the singularities are integrable. 

Before we talk about the singularities specifically, let us develop a picture of the geometry of 971^. 
In fact it will be more convenient, for the most part, to consider the centered moduli space DJlk defined 
in (2.16). As we have shown, the ADHM construction of 9Jtfc, or 9Jlk, is an example of a hyper-Kahler 
quotient based on the quotient group U(k) and starting from flat space. Since the quotient group has 
an abelian factor the quotient construction can, in general, involve the parameters ( c of (2.70) taking 
values in the u(l) subalgebra of the Lie algebra of U(k). However, the ADHM construction requires 
( c = and this has some interesting consequences that we now elaborate. 12 Firstly, the quotient space 
admits a dilatation generated by a homothetic Killing vector which we denote k. 13 To see this, let us 
note that the mother space DJl = M. 4n trivially admits a dilatation generated by the homothetic Killing 
vector k with components 

k™ = z™. (2.119) 

In addition k is hypersurface orthogonal, i.e. 

~ia = Qia- ? (2.120) 

for some function x, which we identify with the hyper-Kahler potential (2.73). When the central terms 
( c = it is easy to see that R, preserves the level set 91 and hence is the lift to IK of a (hypersurface 
orthogonal) homothetic Killing vector k on the quotient space. The dilatation arises as a consequence of 
the classical conformal invariance of the gauge theory described in II. 4. 2. The existence of the dilatation 
implies that the metric on the quotient DJik has the form of a cone [26]. A space of this kind is known 

12 The case C c 7^ corresponds to the ADHM construction in the gauge theory defined on a non-commutative spacetime 
as described in §XI.l. 

13 A homothetic Killing vector is a conformal Killing vector, so C K g — <j>g, for which is a constant. 
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as a hyper-Kahler cone. At this point it is convenient to talk about the centered moduli space DJlk- 
Everything we have said above is equally applicable to 971^ which has hyper-Kahler potential x equal 
to x i n (2.74) with the trace components of a' n set to zero. The fact that 971^ is a cone means that 
DJl k ~ R + x £ with a metric 

ds 2 = ^ + X dsl. (2.121) 
Since # is not a sphere, DJlk has a conical singularity at x — of which we will have more to say below. 

The quotient space also inherits the obvious SU(2) isometries (acting in the obvious way on the a 
indices) from the mother. These isometries on 9JT are generated by three Killing vectors with components 

n (c)ia = J^a. z iP = iT c«. z iP . (2.122) 

As with the dilatation above, it is precisely when the central terms ( c vanish that these symmetries are 
inherited by the quotient space DJlk- To see this one simply has to verify that the action preserves the 
level set \i c = 0. 14 Notice that the four vectors k and are mutually orthogonal. 

The existence of the dilatation and SU(2) isometries together implies that the centered moduli space 
is locally a product OJt^. = pR 4 /Z 2 ] x for some 4kN — 8 dimensional space H. 15 In fact, when DJlk 
is described as the cone (2.121) the space $ is a tri-Sasakian manifold, which due to the structure of 
SU(2) isometries described above, has the form of a non-trivial fibration of SO(3) over 0. One can 
further show that is a quaternionic Kdhler space (see [27,28] for more details and other references). 

Having established the interesting cone geometry of 9JTfc we now turn more specifically to its singu- 
larities. It is not hard to pin down the reason for the singularities. The ADHM construction requires 
us to quotient the level set 9T by the U(fc) auxiliary symmetry group and this procedure will introduce 
orbifold singularities at points in OT where U(k) does not act freely, i.e. where some subgroup of U(fc) 
leaves a^, a solution to (2.65), fixed. Geometrically, the volume of the gauge orbit through any point 
in 9T is proportional to the determinant of the matrix of inner products of the U(k) tri-holomorphic 
Killing vectors {X r }: 

1/2 

det k 2~g(X r ,X s ) . (2.123) 

The k 2 x k 2 matrix with elements g(X r ,X s ) plays a ubiquitous role in the instanton calculus and we 
introduce the notation 

g(X r ,X s ) = L rs = Mhv k {T r LT s ) , (2.124) 

where the generators T r of U(fc) were defined previously in §11.4.1. Here, using (2.66) and (2.69), L is 
an operator on k x k Hermitian matrices of the form 



i{«> V*, n} + is'^n - a' aa na' aa + ins*Xd > (2125) 



L ■ n = \{w a w^ Q} + \a! aa d a6 p. - a aa Qa' a(X + iQa'™ a 

= ±{w & w & ,n} + [a' n , KM 



At a point where U(fc) does not act freely, the operator L develops one, or more, null eigenvectors. 
The relevant effect can be seen already at the one instanton level. In this case, the ADHM constraints 

14 It is worth pointing out that the action is not tri-holomorphic because the SU(2) rotates the complex structures. 
15 The 1*2 quotient (acting by inversion) arises because the centre of SU(2) lies in the U(fc) quotient group. 
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are more explicitly (2.93). The auxiliary group U(l) acts by phase rotation, % — > e l(t> w a , but does 
not act freely when w a = which, from §11.4.3, is the point at which the instanton has zero scale size 
p. This is precisely in accord with our description of the centered moduli space 9Jtfc above as a cone 
over the tri-Sasakian space For the single instanton the apex of the cone is the point p = 0. At 
the one-instanton level we can be more explicit. In (2.87) we have identified the action of global gauge 
transformations on DJlk- In fact at the one-instanton level all gauge orbits are equivalent to the generic 
one (2.87) and therefore the whole centered moduli space DJti is simply a cone over the gauge orbit: 

m - K + x . - SU(iV) - - , (2.126) 
S(U(7V-2) x U(l)) 

where the variable along the cone is x °t P 2 - Since, at the one-instanton level, there are no additional 
singularities in 9Jt fc , the corresponding $ is an example of a homogeneous tri-Sasakian space [27]. 

As discussed above, the centered moduli spaces DJlk for instanton charge k > 1 also have the structure 
of a cone over a tri-Sasakian space. The apex of the cone in this case corresponds to the point where 
w a = and all the non-trace parts of a' n vanish. We will shortly interpret this as the point in the moduli 
space where all the instantons have shrunk to zero size and all lie coincident in IR 4 . This is the point of 
maximal degeneracy where the whole of U(/c) is fixed. However, there are other singularities of smaller 
co-dimension in the moduli space reflecting the fact that the tri-Sasakian space # in this case is not 
homogeneous and has singularities of its own. These can be uncovered in the following way. In §11.4.3, 
we have described how multi-instanton configurations in various asymptotic regions of the moduli space 
can be identified with clusters of smaller numbers of instantons. In particular, when w U i a — > for some 
fixed % — I, the ADHM constraints imply that the elements (a' n )u, (a' n )n — > 0, for % ^ I. In the limit, the 
subgroup U(l) C XJ(k) corresponding to 

w uia *■ w uia& i i a n)ij > ^ ^ ^^i^nJij (2.127) 

does not act freely. This is a limit in the moduli space, where the /c-instanton configuration looks like 
a smooth (k — l)-instanton configuration along with a single instanton that has shrunk to zero size. 
The point-like instanton still has a position X l n = —(a' n )u', hence, in the limit 974 — > %Rk-i x This 
process can continue. There are regions where some subgroup U(l) r C U(/c) does not act freely, which 
corresponds to a (k — r)-instanton configuration along with r point-like instantons. In this limit the 
moduli space is of the form 

Wl k — ► aJt fc _ r x Sym r M 4 . (2.128) 

Here, Sym r M 4 is the symmetric product of r points in M 4 . 16 Additional singularities arise when the 
point-like instantons come together at the same spacetime point. This describes a situation where a 
non-abelian subgroup of U(k) does not act freely. The maximally degenerate situation is when the 
whole of U(fc) is fixed. This occur at the apex of the hyper-Kahler cone and describes a configuration 
where all the instantons have shrunk to zero size at the same point in IR 4 . 

In a certain mathematical sense, the moduli space Wl k excludes the regions with point-like instan- 
tons. It then has a natural compactification by including the regions with point-like instantons at the 

16 This is the product (E 4 ) r modded out by the group of permutations on r objects arising from the subgroup of the 
Weyl group of U(/c) that permutes the r labels {i} for which w U ia 0. 
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boundary [29] (see also [6]). However, in the semi-classical approximation of the functional integral, 
the singularities of the instanton moduli space do not lead to a divergence of the collective coordinate 
integral and — for all practical purposes — we need not distinguish between DJlk and its compactification. 
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Ill The Collective Coordinate Integral 



In this section, we describe how instantons contribute to the functional integral of the field theory in the 
semi-classical limit. To start with in §111.1 we consider the problem of expanding around an instanton 
solution in the functional integral. We follow the approach of Bernard [23] suitably generalized in 
an obvious way to k > 1 (see also the thorough treatment in Osborn's review [8]). In particular, 
following [23], we take advantage of certain simplifications that occur in singular gauge arising from 
the fast fall off of the gauge potential. The goal of this section is to show that the leading-order semi- 
classical approximation involves an integral over the collective coordinates with a measure defined by 
the inner product of the corresponding zero modes along with a ratio of determinants arising from the 
fluctuations around the instanton. In §111.2 we explain how a concrete expression for the integration 
measure can be obtained using the hyper-Kahler quotient perspective, although this was not originally 
how the measure was found in Refs. [30,31]. We go on in §111.3 to review the successes and failures of the 
old instanton literature regarding the fluctuation determinants in the general k instanton background. 
We do not develop this subject any further because the ratio of fluctuations in a supersymmetric theory 
is a trivial collective coordinate independent constant factor. 



III.l From the functional to the collective coordinate integral 

The semi-classical approximation is a saddle-point method and, as such, in order to find the leading 
order behaviour of the functional integral, we need to consider the fluctuations around the instanton 
solution. To this end, we expand 

A n {x) = A n (x; X) + 8A n {x- X) , (3.1) 

where A n (x;X) is the instanton solution as a function of the collective coordinates and 5A n (x;X) are 
the fluctuations which are chosen to satisfy the background gauge condition (2.24) 17 

V n 5A n = . (3.2) 

Notice that the fluctuations depend implicitly on the collective coordinates. The action of the theory 
to quadratic order in the fluctuations is (2.27) 

S = -2rrikr -^Jd 4 x tr N 5 A &a A {+) J 5 A pdl + ■■■ . (3.3) 

We must also include the gauge-fixing term involving the ghost fields 

S g{ = 2 J d 4 x tr N bV 2 c . (3.4) 

The fluctuations can be expanded in terms of the eigenfunctions of the operator which can be 
split into the zero modes, already extensively investigated in §11.5, and the non-zero modes: 

5A n = J2e^A n + A n . (3.5) 



17 



In the following all covariant derivatives are defined with respect to the instanton solution A n (x; X). 
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Here, the non-zero mode fluctuations A n are functionally orthogonal to the zero modes. In the functional 
integral we now separate out the integrals over the zero and non-zero modes: 

J [dA n ]=g- 4kN J {Vd^gJX) H^=} ( 3 - 6 ) 

Here, g(X) is the metric on the zero modes defined in (2.32). The term in braces is the integral over 
the zero mode subspace. The factors of g in front of the measure arise from the fact that we included 
a factor of g 2 in the definition of the metric (2.32). 

The non-zero mode fluctuations A n and the ghosts can now be integrated out, producing the usual 
determinant factors: 

J [dA n ] [db] [dc] exp l - J d 4 xtr N {jW^ - AbV 2 c) = • (3-7) 

As is conventional, the prime on the determinant indicates that the operator A^ + ) has zero modes and 
these must be excluded in the product over eigenvalues that defines the determinant. So the leading 
order expression for the functional integral in the charge-Zc sector is 

At this stage, (3.8) is only schematic because in order to define the determinants rigorously, we must 
regularize the theory in some way. We shall consider this problem in §111.3. 

In the final part of this section we explain how the integrals over the expansion coefficients of the 
zero modes may be traded for integrals over the collective coordinates {X^}. This change of 
variables is facilitated by the well-known trick of inserting an expression for unity in the guise of 

1 ee J \{dX» |det^| Y[S(f v (X)) . (3.9) 
A judicious choice for the function j v {X) is the inner product of the fluctuation (3.1) with the zero 

f„(X) = -2g 2 I d i xtr N 5A n 5 v A n = ^"g^X) , (3.10) 

where we used (3.5) and the fact that the zero and non-zero modes are functionally orthogonal, along 
with the definition of the metric g /J , l/ (X) as the inner product of zero modes (2.32). 

Now we compute the derivative 

dx^ = ~ 29 J dxtTN \^ 6 » An + 6An ^x^\ ■ (3 - n) 

Since the total field A n (x) in Eq. (3.1) does not depend on the collective coordinates, we can replace 
5A n by — A n (x; X) in the first term. Using the fact that 

J d A x tr N VSt^yAn = - J d A x tr N Q^T> n S u A n = , (3.12) 
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mode 5 u A n : 



where the surface term vanishes due to the asymptotic form of f2 M and 5 u A n (see §11.5), and the last 
equality follows from (2.24), we have 

-2? J d 4 xtr N 9A j^ X) 5 v A n = g, u {X) . (3.13) 

The insertion of unity is then 

1 = J ]JdX» jdet (g^X) -2g 2 J d 4 xtr N SA n ^^-)\ ^(E^PO) ■ (3-14) 

Plugging this into (3.8), we perform the integrals over the expansion coefficients using the 5-functions. 
Since g^X) is invertible the 5-functions enforce ^ = 0. Consequently in the second term of the 
determinant we can replace 5A n by the non-zero mode part of the fluctuation A n . This term is higher 
order in g and may be dropped to leading order. Finally we have to leading order in in the charge-Zc 
sector 

/ [dA n ] [db] [dc] e - s[A ' b ' c] [ u ■ d f fe^ . (3.15) 

Notice that the resulting form of the collective coordinate integral is proportional to the canonical 
volume form on the moduli space Tlk associated to the metric g^ u (X): 

I «= / ( 3 - ie ) 

Jm k J " v2tt 

along with a non-trivial function on QJtfc equal to the determinants of the operators governing the 
Gaussian fluctuations of the gauge field and ghosts in the instanton background (3.7). 

Finally, when calculating a correlation function {Oi(x^) ■ ■ ■ O^x^)) in the semi-classical limit, the 
fields insertions Oi(x^) are replaced by their values in the instanton background and so, to leading 
semi-classical order, become functions of the collective coordinates. 



III. 2 The volume form on the instanton moduli space 

We have shown that the leading order semi-classical approximation of the functional integral involves an 
integral over instanton moduli space with a measure associated to the natural metric inherited from the 
inner product of the zero modes (2.32). At leading order in the saddle-point expansion, the integrand 
includes determinants of the fluctuations (3.15), but in the present section we focus on the volume form. 

The most direct way to obtain a workable expression for the volume form on the instanton moduli 
space is to use the fact that this space can be described as a hyper-Kahler quotient of flat space. 
Actually this is not how the volume form was originally found in Refs. [20,30,31]. In these references, 
the constraints of supersymmetry, the Index Theorem and other consistency conditions (most notably 
"clustering" in the dilute instanton gas limit) were used to write down the unique expression for the 
volume form for a supersymmetric gauge theory. Then, decoupling the fermions and scalars, by giving 
them large masses, an expression for the volume form on 9Jt fc was derived. Here, we will not follow this 
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route, rather we shall take as our central theme the hyper-Kahler quotient construction (an approach 
also adopted in [32]). Since the quotient space DJlk inherits a metric from the mother space 971 it also 
inherits a volume form. Recall from §11.4.1 and Appendix B, there are two parts to the hyper-Kahler 
quotient. Firstly, one restricts to the level sets 91 C 97T defined by the vanishing of the moment maps 
\x = 0. Then one performs a quotient by the group G finally giving DJl k = %l/G. An expression for the 
volume form on 97tfc, is be obtained from that of 97t by imposing the vanishing of the moment maps by 
explicit 5-functions with the appropriate Jacobian. One then divides by the volume of the orbits of the 
G-symmetry. This leads in general to the expression (B.33). An expression for the volume form on Wl k 
is then obtained by "gauge fixing" the U(k) symmetry. However, it turns out to be more convenient 
to leave the G-symmetry "un-fixed" . This is because the expressions for all the fields in an instanton 
background are G-invariant. 

For the ADHM construction the mother space is 971 = ^ 4k ( N + k ) parameterized by with metric 
(2.66) and G is U(k) acting in (2.64). The moment maps are the ADHM constraints (2.59a). The U(fc) 
action defines a set of vector fields on 971, X r , r = 1, . . . , k 2 , given in (2.69). The inner-product of these 
vector fields is given by (2.124) which defines the ADHM operator L which appears in the formula for 
the volume form. The volume form of the ADHM moduli space in its U(/c)-unfixed form is, from (B.40), 

J^ = J d^a \det k ,L\ J] f[ ^tr^rVa*)) ■ ( 3 ' 17 ) 

In the above, we have defined 

4 k 2 k N 2 

f d 4k ^a = / n n d ^ r n n n ^ , 

n=l r=l i=l u=l a=l 

where the integral over the k x k matrices a' n and the arguments of the ADHM constraints are defined 
with respect to the generators of U(fc) in the fundamental representation, normalized so that tr k T r T s = 
5 rs . The volume of the U(fc) is the constant 

Ok fe(fe+l)/2 

VolU(fc)= . (3.19) 

rii=i ^ 

The normalization factor C k can be determined by taking into account the normalization of the metric 
(2.66), giving 

C k = 2 - fe ( fe - 1 )/ 2 (27r) 2fc7V . (3.20) 

Our expression for the volume form on the instanton moduli space can be compared with the one- 
instanton expressions in the literature [2,23]. For k — 1 the ADHM are resolved as described in §11.4.3. 
In fact the necessary change of variables from w U a to the scale size p and gauge orientation XL is described 
in §VI.l. In particular, the (non-supersymmetric) one-instanton measure is given by (6.29) with M = 0, 
k = 1 and with W° = 2p 2 and a' n = —X n . In order to compare with the literature we integrate over 
the gauge orientation 11 which is conveniently normalized so that J dU = 1. This leaves 

u = f J / d 4 Xdpp AN ~ 5 , (3.21) 



/ 



l m (N-l)\(N-2)\ 
which agrees with the expression for gauge group SU(A^) derived in [23] 
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III. 2.1 Clustering 



The relative normalization constants C^, for different k, can be checked by using the clustering property 
of the instanton measure. This requires in certain regions of moduli space where a k instanton config- 
uration can be interpreted as well separated k x and k 2 instanton configurations, 971^ is approximately 
DJtfci x 9Jtfc 2 , and so the volume form factorizes as 

h\k 2 \ 



/ uj x / u> . (3.22) 
Jm k , Jm kn 



<m k k\ Jyji kl J<m k2 

We can determine the overall normalization of the volume form by going to the completely clustered 
limit defined in §11.4.3. In this limit, we can describe the positions of the single instanton by the diagonal 
elements of a' n , (a' n )u = —X l n . Part of the U(k) symmetry can be fixed by setting to zero the off-diagonal 
elements of a' n that are generated by U(k) adjoint action on the diagonal matrix diag(— X*, • • • , —X*); 
as in §11.4.3 we will denote the remaining off-diagonal elements as a!^. In the measure, this gauge fixing 
involves a Jacobian factor: 18 

2fc(fc-l)/2 1 



VolU(fc) 



The off-diagonal ADHM constraints are linear in a!^ (2.101) and the corresponding 5-functions in 
(3.17) can be used to integrate out completely the 3k(k — 1) variables a! n . These integrals produce a 
factor n^j(^ 1 — X j )~ 3 . To complete the analysis we note that in the complete clustering limit 

k 

detL = ]j2p 2 i ■ \{{X l - X 3 ) 2 + ■ ■ ■ (3.24) 

i=l i^j 

where 2p 2 = L iy the i-operator of each individual instanton. 

Putting everything together one finds in the complete clustering limit, with the normalization con- 
stant given in (3.20), 

/ uj — > — u> x • • • x / w , (3.25) 
Jm k k\ J mi Jmi 

k times 

as required. 



III. 3 Fluctuation determinants in the instanton background 



Determinants of fluctuation operators in the background of a single instanton were first evaluated by 
't Hooft in the classic paper [2]. In this section, we sketch how the determinants of the fluctuation 

18 The following formula can easily be derived from the well-known Jacobian that arises from changing variables from 
the elements of a k x k Hermitian matrix X (defined with respect to our basis T r ) to its eigenvalues X 1 : J dX = 
(27r) fc ( fc - 1 )/ 2 /nti* ! ' J dX * II, .,' A ' -V • '. See for example [33]. 
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operators in a general ADHM instanton background can be found, although we omit many of the 
technical parts of the calculation, principally because in the supersymmetric case the determinants over 
non-zero modes cancel between the bosonic and fermionic fields [34]. A complete review of the old 
instanton literature pertaining to the fluctuation determinants may be found in Osborn's Review [8]. 
Ideally, one would want to express the results in terms of geometric quantities on the instanton moduli 
space. This was never achieved and the final answers are quite implicit, involving spacetime integrals. 

The first thing we need to do is to provide properly UV-regularized definitions of the determinants. 
We will do this by introducing Pauli-Villars regulator fields with large masses ^ and alternating "metric" 
ej, such that 

V V 

J> = -1, ]Te^ p = p = l,...,v-l . (3.26) 
i=i i=i 

We will also define 

V 

log// = -J^eilog/Xi . (3.27) 

i=i 

For a consistent regularization the number of regulator fields v must exceed three. The regularized 
determinants are 

V 

log det'A W = Tr | log(AW + %) + ^ e t log(A« + l%) } , (3.28a) 

i=l 

v 

log det A (_) = Tr | log A ( ~ } + ^ e, log(A ( - } + // 2 ) } . (3.28b) 

i=i 

In (3.28a), 7q is the projector onto the zero mode subspace of A^ + ^. Since the non-zero eigenvalues of 
A(±) are identical; hence, we can extract a very simple expression for the ratio 

= exp (2kN £ e^) = ^ kN , (3.29) 
i=i 

Here, 2kN is the number of zero modes of A^ + ) and jx is the overall Pauli-Villars mass scale (3.27). Using 
this relation and the fact that det A^ = {det(-D 2 )} 2 we can express the fluctuation determinants in 
(3.15) in terms of the determinant of the covariant Laplacian: 

det(-X> 2 ) _ , AkNA ^, _ w 



det'AM 



^det(-V 2 )- . (3.30) 



We have succeeded in reducing the problem to that of the fluctuation determinant of a scalar field 
transforming in the adjoint representation of the gauge group. By pooling together results from the 
old instanton literature we can find an expression for this determinant. Firstly, one can construct 
the determinant for a scalar field transforming in the fundamental representation of SU(iV) following 
Refs. [8,35,36]. The determinant for an adjoint- valued field is then related to this by a remarkable 
formula of Jack [37]. 

So we begin with a brief sketch of how one determines the fluctuation determinant of a scalar field 
transforming in the fundamental representation. The key idea involves considering the variation of 
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logdet(— ^ ,2 )fund. by the collective coordinates. 19 The resulting formula established in [8,35,36] is 

d 



^-logdet(-£> 2 ) fund . = J (PxtiNd^AnJn , (3.31) 

where J n is the conserved current 

J n = Ua na6t b a fK^fbpU - Wfb^fKa^U (3.32) 

and S^A n is the zero mode associated to X M . By integrating this expression one can extract the ratio of 
the determinant in the instanton background to the determinant in the vacuum. The final expression 
established in [35] is expressed solely in terms of the ADHM matrix /: 

rdetf— T> 2 )i k If 

lo Hdsh^]^. = 6 log,,+K + s^/ A(/lW + /2W) ' (3 33) 

where V is the covariant derivative in the vacuum (A n = 0), p k is a constant which will be determined 
shortly and the integrands are 

Ak 

h(x) = tT k (fd n r i fd n r i fd m r i fd m r i - 20/ 2 ) + {1+x2)2 , (3.34a) 

I 2 {x)= [ dte^tikifdJ^fdJ^fXf-'mr'fdJ- 1 ) . (3.34b) 
Jo 

Here, t is an auxiliary variable and f(x,t) is the k x k dimensional matrix derived from f(x): 

f-\x,t) = tf- l (x) + (1 - t)(l + x 2 )l [k]x[k] . (3.35) 

In order to fix the constant p k we can appeal to a clustering argument. In the complete clustering 
limit the k x k matrix / is diagonal: 

^^-■ (x-^^ '- 1 ' (3 ' 36) 

where pi and X l n are the scale size and position of the i th instanton, respectively. In this limit the 
integrals in (3.33) can easily be evaluated. (Actually the integral of I2 vanishes.) One finds 

iog [SS]-. = ^ (|iog(m) -* )+pt - (3 - 37) 

Comparing with the one instanton expression for this determinant [2] fixes the constant to be 

p k = k(a{\) + f- 8 ) = k(- 2C'(-1) - J log 2 + i) . (3.38) 
Here, a(\) is a constant defined in [2]. 

The second part of the problem involves relating the fluctuation determinant associated to the 
fundamental representation of the gauge group to that of the adjoint representation. The way this was 



19 



Here, the subscript reminds us that the scalar field transforms in the fundamental representation of SU(iV). 
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achieved [37] relied on the tensor product ADHM formalism developed in [18]. The analysis provides the 
following explicit relation for the fluctuation determinant of the Laplace operator for an adjoint- valued 
field in terms of one for a fundamental-valued field: 

, det(~V 2 ) oArl rdet(-P 2 )i 

log: - = 2iVloe: 

g det(-D 2 ) g [det(-P 2 )J fU nd. (33g) 

+ logdet fe2 X - ^jL J dx 4 logdet fc / D 2 logdet fc / + q k . 

Here, L is the ubiquitous ADHM operator on k x k matrices that we defined in (2.125). As above, we 
will determine the constant q k by a clustering argument. In the complete clustering limit, we find 

l0g del(-Pg) = 2jV £ (J hgim) + a(1 ^ + Hhg2 ~« ) + qk - (3 ' 40) 
Comparing this with the one instanton expression [2], we find 

q k = k{a(l) - 4a(|) - log 2 + §) = \k . (3.41) 
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IV Instantons in Supersymmetric Gauge Theories 



In this Chapter we consider instanton configurations in gauge theories with supersymmetry. There is an 
intimate relation between instantons and supersymmetry which can be traced to the fact that, in any 
supersymmetric gauge theory, self-dual gauge fields are invariant under precisely half the supersymmetry 
generators. As we will see, this is manifest in the ADHM construction, which has a very natural 
supersymmetric generalization. Specifically we consider the minimal theories in four dimensions with 
gauge group SU(iV) and M = 1, 2 and 4 supersymmetry. Like the purely bosonic gauge theory 
considered in previous chapters, the M = 1 and M = 2 theories are asymptotically free and the 
relevance of instantons in the quantum theory is not immediately obvious. However, we will begin by 
focusing on the classical aspects of these configurations. 

Before searching for supersymmetric instantons there is the worrisome issue of supersymmetry in 
Euclidean space to discuss. The nub of the issue is the following: Weyl spinors in D = 4 Minkowski 
space are in a "real" representation of the covering group of the Lorentz group SO(3, l). 20 Consequently 
the minimal spinor in D = 4 is a Majorana spinor, which we can think of as two Weyl spinors A and A 
subject to the reality condition 

A d = (A Q )t (a = a) . (4.1) 

This ensures, for instance, that the canonical fermion kinetic term is real. On the contrary, in D = 4 
Euclidean space, Weyl spinors are in "pseudo real" representations of SO (4) and one cannot impose 
the reality condition (4.1). On the contrary, SO (4) ~ SU(2) x SU(2) so the spinor indices a and a 
refer to each of the SU(2)'s and they are not mixed under complex conjugation. In Euclidean space 
there is no notion of a Majorana spinor and, apparently, no Euclidean version of the theory with a real 
action. This problem may be by-passed in theories with extended supersymmetry as the Weyl spinors 
in these theories may be combined in pairs to form Dirac spinors which do have Euclidean counterparts. 
However, the problem is unavoidable for theories with Af = 1 supersymmetry. 

There are several alternative approaches to this issue favoured by different workers in the field. The 
most conservative approach as described in [38] (See Appendix A of this reference), is to abandon the 
idea of constructing a Euclidean quantum field theory with J\f = 1 supersymmetric as unnecessary. In 
particular, our main interest should be calculating Green's functions in the Minkowski space theory. 
As in standard perturbative calculations, these Green's functions are most conveniently calculated 
by analytic continuation of the corresponding Minkowski space path integral to Euclidean spacetime. 
This is effected by the standard Wick rotation of the time coordinate x — > — ix^. The path-integral 
exponent is then —Se = iSm- The resulting path-integral can then be evaluated in the saddle-point 
approximation by expanding in fluctuations around the minima of Se- This procedure yields finite and 
well-defined answers for the original Minkowski space Green's functions. In this context, the fact that 
the fermionic part of Se is not real is inconsequential: it does not affect the convergence of the integrals. 
Some authors argue that reality of the action is not, in any case, the appropriate condition for Euclidean 
space theories. Rather we should impose a modified condition known as "reflection positivity" which is 

20 Our conventions for spinors are described in Appendix A. In particular in Minkowski space our conventions are those 
of Wess and Bagger. 
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characteristic of fermionic actions on a spacetime lattice. 



In the following we will adopt the conservative viewpoint described above. However, there are other 
approaches to instantons which divorce themselves from the Minkowski space theory and seek to define 
the supersymmetric theory directly in Euclidean space [13,39]. This follows work of several authors who 
have shown that it is actually possible to define Euclidean versions of the theories with both extended 
supersymmetry and real actions [40-44]. These theories have the potentially undesirable feature of 
a non-compact i?-symmetry group and a scalar field with a negative kinetic term. The philosophy 
is therefore slightly different from our viewpoint — and the relation with the original Minkowski space 
theory is now rather obscure — but the resulting calculations are essentially identical to those described 
below. 

In the bulk of this chapter we will discuss the M = 1, 2 and 4 theories in a unified formalism. In the 
remainder of this introductory section we will introduce some of the key points which arise, starting with 
the M = 1 theory which contains the gauge field and a single species of Weyl fermion X a in the adjoint 
representation of the gauge group. Setting the fermion fields to zero to start with, the ADHM instanton 
configuration trivially solves the equations-of-motion. However, must now consider the fluctuations of 
the fermions around this solution. These are governed by the following covariant Dirac equations, 



where the covariant derivatives are taken in the adjoint representation and evaluated in the instanton 
background of topological charge k. As already discussed in §11.3, a standard application of the Atiyah- 
Singer index theorem shows that equation (4.2a) has 2kN linearly independent (normalizable) solutions. 
These were identified explicitly in §11.5 as A a (C{), were i = l,...2kN labels the solutions of the 
constraints (2.109a) and (2.109b). On the contrary, (4.2b) has no non-trivial solutions in an instanton 
background. 

These solutions are known as the fermion zero modes of the instanton. There are two distinct — but 
ultimately equivalent — approaches to treating these modes. One way of incorporating these modes is 
to treat them perturbatively as fluctuations around the ordinary bosonic instanton. This expansion 
is perfectly consistent, but is hard to carry out in practice beyond the lowest orders. An alternative 
approach, and the one we will adopt here, originates in the work of Novikov et al [38]. In this approach, 
we interpret the fermionic zero modes as corresponding to a degeneracy of classical solutions just like 
zero modes of the gauge field we met in the previous chapter. More generally, the standard semi-classical 
reasoning suggests that we should look for finite-action solutions of the full coupled equations-of-motion 
of the supersymmetric theory. We will call these configurations super-instantons. In the present case 
they are easy to find: the gauge field takes is ADHM value, while the fermions solve equations (4.2a) 
and (4.2b). Thus the right-handed fermion is zero; this ensures that the fermions do not modify 
the Yang-Mills equation for the gauge field. Meanwhile, the left-handed fermion X a is general linear 
superposition of the normalizable zero modes: 



Tp\ A = , 
Tp\ A = 0, 



(4.2a) 
(4.2b) 




(4.3) 



i=i 
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As X a is a fermionic field, the 2k N coefficients ip 1 are Grassmann variables. In fact (4.3) can be written 
more compactly as X a = A a (Ai), where the Grassmann quantities M. = i\?C\ themselves satisfy the 
constraints (2.109a) and (2.109b). We will call these constraints the "fermionic ADHM constraints" 
since they will turn out to be the Grassmann superpartners of the "bosonic" ADHM constraints (2.65). 
The solutions to these constraints will then be parameterized as M.^ 1 )- 

The Grassmann variables ip 1 are the fermionic analogs of the collective coordinates which parame- 
terize the general ADHM solution and we will henceforth refer to them as Grassmann collective coordi- 
nates. While the bosonic collective coordinates X M have the interpretation as intrinsic coordinates on 
the moduli space 9Jtfc, their Grassmann counterparts correspond to intrinsic symplectic tangent vectors 
on this manifold (see Appendix B). Like the bosonic collective coordinates they parameterize degen- 
erate minima of the action and we must integrate over them. We will work out the exact integration 
measure in the following but the important qualitative features follow from the basic rules of Grassmann 
integration, 

r dtl? = , / dip ft = <5 1J . (4.4) 



We see that to obtain a non-zero answer, each Grassmann integration in the measure must be saturated 
by a single power of the integration variable. This leads to simple selection rule for fermionic Green's 
functions in the k instanton background. Specifically the Green's function, (\( y x 1 )\(x 2 ) . . . A(xj)) will 
vanish unless I = 2kN. This counting can be understood by noting that N = 1 supersymmetric Yang- 
Mills theory has an anomalous abelian i?-symmetry under which the left- and right-handed fermions 
have charges ±1. The exact anomaly in the i?-symmetry current jj? is determined at one loop as, 

d ™j* = ^ tr^v F mn *F mn (4.5) 

Integrating this equation over spacetime we find that conservation of the corresponding U(l) charge is 
violated by 2kN units in the background of topological charge k. which agrees with the selection rule 
given above. This agrees with the selection rule described above. The symmetry means that observables 
of definite i?-charge receive corrections from a single topological sector if at all. As we will see below, 
much more interesting behaviour is possible in theories without an anomalous abelian i?-symmetry. 

In the previous chapter we saw that a subset of the bosonic collective coordinates can be understood 
in terms of the action of symmetries of the theory on the instanton solution. The same is true for 
the Grassmann collective coordinates. As mentioned above, any self-dual configuration of the gauge 
field is invariant under half the generators of the supersymmetry algebra. On the other hand, the 
other half of the generators act non-trivially on the solution generating fermion zero modes. The 
J\f = 1 supersymmetry algebra has four supercharges and yields two zero modes. In fact, classical 
supersymmetric gauge theories in four dimensions are invariant under a larger superconformal algebra 
which, in the M — 1 case, includes four additional fermionic generators, two of which act non-trivially 
on the instanton. Broken symmetries thus yield a total of four fermion zero modes. The argument 
which guarantees the existence of these zero modes is very robust (it is essentially Goldstone's theorem) 
and holds at all orders in the full quantum theory. As above, the actual number of zero modes is 2kN 
which equals four in the minimal case k — 1, N — 2. Thus, the situation for fermionic zero modes 
matches nicely with the corresponding counting of bosonic zero modes given in the previous chapter. 
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Specifically, the bosonic and fermionic zero modes of a single instanton of gauge group SU(2) are all 
associated with the action of broken symmetry generators. For higher instanton number and/or larger 
gauge group there are additional zero modes which do not correspond to broken symmetries. 

Theories with extended supersymmetry or, more generally, M = 1 theories with additional matter 
necessarily contain scalar fields. As usual, the scalars can acquire vacuum expectation values (VEVs) 
which spontaneously break the gauge symmetry. This possibility is particularly important for applica- 
tions of instanton calculus to theories with asymptotic freedom. As usual such theories are characterized 
by logarithmic running of the coupling which introduces a dynamical scale A. In the absence of scalar 
VEVs, the running coupling becomes large in the IR at mass scales of order A and we do not expect 
semi-classical methods to work. However, if we introduce a scalar VEV which breaks the gauge group 
at some scale v then the effective coupling will not run below this scale. In particular, if the scale of the 
VEV is much greater than A, the running coupling is frozen before it has a chance to become large and 
the theory is weakly coupled at all length scales. In these circumstances we can expect a semi-classical 
analysis of the path integral to be reliable. 

Introducing scalar fields, with or without VEVs, affects the instanton calculus in several important 
ways. In addition to the gauge couplings of the scalars, supersymmetric Lagrangians necessarily contain 
Yukawa couplings between the scalars and fermions. As above, we are looking for a super-instanton 
which solves the full equations of motion of the theory. A promising starting point is the configuration 
described above where the gauge-field takes its ADHM value and the fermions are a general linear 
combination of the zero-modes. The scalars themselves satisfy a covariant Laplace equation in the 
gauge field background with a fermion bilinear source term which comes from the Yukawa coupling 
and, as we review below, this equation can be solved explicitly. The solution exhibits a new feature 
of the super-instanton: bosonic fields can have pieces which are bilinear (or of higher even power) in 
the Grassmann collective coordinates. A potentially worrying feature is that a complex scalar field 
typically acquires a non-zero Grassmann bilinear part, while its complex conjugate does not. There 
is no inconsistency here, as we can illustrate by considering the following toy integral: 



d^e-W +tW , (4.6) 

where A and B are quadratic expressions in Grassmann variables and we suppose A ^ B*. Obviously, 
one should expand in the Grassmann composites A and B, since the series terminates because there 
are only a finite number of Grassmann variables, and then do the integral: 

p p 

I = V -^—A m B n [ d 2 (f) m 0* n e~ w2 = 7T V ^-(AB) m = ne AB . (4.7) 
^ m!n! J m\ 

m,n=0 rn=0 

However, the same result can be obtained by solving the "equations-of-motion" of and 0* and shifting 
the integration variables by the solution. The solutions are 

= 5, 0* = A , (4.8) 
which, since by hypothesis A^ B* , violate the reality condition on 0. However, 

J = J d 2 <Pe-^- B ^^ +AB = e AB J d^e^ 2 = ne AB (4.9) 
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reproducing (4.7). In this way solving the equation-of-motion in the instanton background will involve 
Grassmann composite terms which violate the reality conditions of the fields, however, as we have seen 
with the toy integral above, it is a convenient book-keeping device. 

Even in the absence of scalar VEVs, the configuration described above is not the end of the story 
because it does not necessarily solve the full coupled equations-of-motion. In particular, the non-zero 
Grassmann bilinear part of the scalar field can modify the equations of motion for the fermions and even 
for the gauge field itself, invalidating our starting ansatz for these fields. In general this modification 
will be non-trivial unless it is forbidden by the symmetries of the theory. For theories with an abelian 
i?-symmetry, such as the Af = 2 theory without scalar VEVs, the new terms in the equations are zero 
and our candidate super-instanton actually solves the full equations of motion. However, in one of 
the most important cases, that of Af = 4 supersymmetric Yang-Mills, there is no such symmetry and 
the modification is unavoidable. Even in this case the situation is not as hopeless as it might appear 
because the offending couplings are each suppressed by powers of g 2 at weak coupling. Fortunately, for 
our stated purpose of calculating the leading semi-classical contributions to Green's functions, it suffices 
to solve the equations of motion perturbatively to some order in g 2 . The resulting configuration differs 
from being an exact solution by a power of g 2 and hence we will refer to it as a quasi-instanton. The 
question of whether a corresponding exact solution exists is an interesting one. As we are expanding in 
Grassmann bilinears, the perturbation series must truncate at some finite order and one might imagine 
that the final result should be an exact solution. However, at least for the Af = 4 theory we will 
argue that this is not the case. In fact, we will exhibit an obstruction to solving the equations beyond 
next-to-leading order in this case. In any case, the utility of proceeding beyond the first few orders in 
the classical equations is unclear, because one must also take into account quantum corrections which 
modify the equations themselves at the same order in g 2 . 

An important property of the quasi-instanton can be exhibited by calculating its action. The Af = 4 
theory has four species of Weyl fermions which lie in the fundamental representation of an SU (4) R- 
symmetry. The corresponding Grassmann collective coordinates are ip lA where, as in the Af = 1 theory 
discussed above, i = 1, . . . , 2kN and A = 1, 2, 3, 4 is the _R-symmetry index. In the absence of scalar 
VEVs, the action of our quasi-instanton of topological charge k is, 

Rir 2 k 1 

S = — ~ ik6 + ^abcdR^{XW a ^ b ^ c ^ d . (4.10) 

Here, R(X) is the symplectic curvature of the hyper-Kahler quotient space DJlk, which depends explicitly 
the chosen point in the moduli space. The action depends explicitly on both the bosonic and fermionic 
collective coordinate reflecting the fact that we are not dealing with an exact solution of the equation 
of motion. 

From the point of view of the semi-classical approximation, the expression (4.10) can be thought of as 
an "effective action" for the quasi- zero modes, to order g° in the coupling, which results from integrating 
out the remaining modes of non-zero frequency. As we will see in the next Chapter, the exponential 
of this action is an essential ingredient in the collective coordinate integration measure. The effective 
action for the instanton collective coordinates closely related to the idea of a world-volume effective 
action for solitons and other extended object in higher dimensions. This point of view will be developed 
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substantially in Chapter X. 

The dependence of the action (4.10), on the Grassmann collective coordinates means that the cor- 
responding fermionic modes are not exact zero modes. These modes are lifted at order g° in the 
semi-classical expansion. The zero modes which are generated by the action of fermionic symmetry 
generators on the instanton are an important exception. The same symmetries lead to zero eigenval- 
ues of the curvature tensor which mean that the action (4.10) is independent of the corresponding 
Grassmann collective coordinates. The associated zero modes remain unlifted to all orders in g 2 . The 
lifting of fermion zero modes means that the selection rules determining which Green's functions can 
receive instanton corrections are much less restrictive than those of the N = 1 theory described above. 
Typically this is also related to the absence of an anomalous abelian _R-symmetry. For example, in the 
N = 4 theory with zero scalar VEVs, the total number of exact fermion zero modes is sixteen (two 
supersymmetric and two superconformal modes for each of the four species of Weyl fermion). This 
number does not depend on the topological charge and, for example, sixteen-fermion correlators receive 
an infinite series of corrections from all numbers of instantons. 

Finally we turn to the case where scalar fields develop non-zero VEVs. In this case, it is well known 
that there is no non-trivial instanton solution of the coupled equations of motion for the gauge field and 
scalar. Indeed, the existence of such a solution is forbidden by Derrick's theorem. The problem is best 
illustrated by proceeding naively in the SU(2) theory. If we solve the covariant Laplace equation for an 
adjoint scalar with VEV diag(0°, —0°) in the background of a single SU(2) instanton of scale-size p 
and substitute the solution back into the action the result is, 

S = % - id + 4ttVW ■ (4- 11 ) 

The instanton action depends explicitly on the scale size and may be lowered continuously to zero by 
shrinking the instanton. However, it is useful to note that the second term in the action (4.11) is down 
by a power of g 2 relative to the first. 21 Thus our candidate configuration is actually a quasi-instanton in 
the same sense as in our discussion of the Af = 4 theory above. In either case, our aim is to determine 
the leading semi-classical behaviour of Green's functions and it is legitimate to solve the saddle-point 
conditions order by order in g 2 . 

In the case of non-zero scalar VEVs, this approach was first developed by Affleck who referred to the 
corresponding field configurations as constrained instantons . The formalism of constrained instantons 
is rather technical. However, as we explain below, a large part of it can actually be understood in 
terms of a modified instanton action like that given in (4.11). As mentioned above, any quasi-instanton 
will have an action which depends explicitly on the collective coordinates. In the case of the M = 4 
theory without scalar VEVs, the resulting expression, given as (4.10) above, is determined in terms of 
the symplectic curvature on 9Jl k . One of our main results is that the effect of introducing scalar VEVs 
is simply to introduce an appropriate potential on the moduli space. Furthermore this potential has a 
nice interpretation as the norm squared of a tri-holomorphic Killing vector on 9Jt fc (for related results 

21 Strictly speaking this is true as long as we treat p<j>° as order g°. As we must integrate overall values of p this 
assumption needs to be justified. In the cases of interest is not hard to show that that larger values of pcf> are exponentially 
suppressed and the naive scaling is correct. 
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in the context of dyons see [45,46]). 

The remainder of this Chapter is organized as follows. In §IV.l, we present the Minkowski space 
action, equations-of-motion and supersymmetric transformations of the minimal gauge theories with 
M = 1, 2 and 4 supersymmetries in four dimensions. We discuss the analytic continuation of the theory 
to Euclidean spacetime. In §IV.2 we construct the super-instanton at the first non-trivial order in g 2 . 
This necessitates solving the adjoint Dirac equation (4.2b) for the left-handed fermion in the general 
ADHM background. The general solution involves a matrix of Grassmann variables constrained by a 
linear equation which generalizes the ADHM constraint equations of the previous chapter. We introduce 
intrinsic Grassmann collective coordinates and identify them as symplectic tangent vectors on VJl k . We 
discuss the action of supersymmetry on the collective coordinates. 

In §IV.3, we consider the construction of the super-instanton beyond linear order. This requires 
solving the covariant Laplace equation for the adjoint scalar in the general ADHM background with 
an appropriate fermion bilinear source term. We show that this yields an exact super-instanton for the 
J\f = 2 theory (without scalar VEVs), but only a quasi-instanton, in the sense described above, in the 
M = 4 case. We demonstrate an obstruction to the existence of an exact solution in this case. In §IV.4, 
we review the necessary aspects of the constrained instanton formalism. Finally, in §IV.5 we explain 
how the supersymmetry of the field theory descends to the collective coordinates. 

In the next chapter, we derive the instanton measure in supersymmetric theories. We introduce the 
instanton effective action for these theories and discuss the consequent lifting of fermion zero modes. 
Then we derive an explicit formulae for the appropriate supersymmetric volume form on 9JT fc . 



IV. 1 Action, supersymmetry and equations-of-motion 

We start by defining theories in four-dimensional Minkowski space with M = 1, 2 and 4 supersymmetry. 
In the interests of brevity we will develop a unified notion that allows us to deal with all these cases 
together. To this end, we introduce the fermionic partners of the gauge field \ A and A^- Here, A = 
1, . . . , M is an /^-symmetry index of the supersymmetry. Since we are working — at least initially — in 
Minkowski space, these spinors are subject to the reality conditions 

(XiV = K A , (Ai)t = A aA (a = a). (4.12) 

In addition, for the theories with extended supersymmetry there are real scalar fields (f> a , a = 1, . . . , 2{M— 
1). The Minkowski space action is 22 

= / d 4 xtv N {\F 2 mn + % lf-F mn *F mn + 2iV n \ A a n \ A - V n ^ a V n ^ a 
J L 16tt 2 ( 4 _ 13 ) 

+ g\ A ^ B [<Pa, X B ] + g\ A ^aAB[^aA B ] + ^a, hf) ■ 

22 Wc remind the reader that our gauge field is anti-Hermitian rather than Hcrmitian, otherwise our conventions in 
Minkowski space are those of Wess and Bagger [47] . 
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The terms involving the scalar fields are, of course, absent in the Af = 1 theory. The E-matrices are 
associated to the SU(2) and SU(4) i?-symmetry group of the Af = 2 and Af = 4 theories, respectively. 
For Af = 2 we take 

In this case the indices A,B, - ■■ = 1,2 are spinor indices of the SU(2) subgroup of the U(l) x SU(2) 
/^-symmetry group. SO in this case, we can raise and lower the indices using the e-tensor in the usual 
way following the conventions of [47] . For the Af = 4 case 



In the above, 
and 



(4.15) 



S a = ( - i] 3 , if] 3 , -if, if] 2 , -rj 1 , if] 1 ) , 
where rf, ff, c = 1,2, 3, are 't Hooft's 77-symbols defined in Appendix A. 

The theory (4.13) is invariant under the on-shell supersymmetry transformations 

5A n = -i A a n \ A - U° n \ A , (4.16a) 

S\ A = - ia ™t A F mn - ig^ ab A B e[<p a , H + EfV&ZVA , (4.16b) 

5\ A = -ia mn l A F mn - igX abA B U<f>a, M + ^ab^V^ , (4.16c) 

<50 a = t^ aAB X B + iU^ AB \ B ■ (4.16d) 



-.ran 1 / _m-n _n-m\ -mn l/=.m_n ^n^tn\ (a i7\ 

a = -^[a o — o o ) , o =-^[00—00). ^4.1/ J 



^ab — |(S a E 6 — S b S a ) , S ab — i(S a S 6 — SbSa) . (4-18) 



In order to construct instanton solutions, we now Wick rotate to Euclidean space. Vector quantities 
in Minkowski space a n = (a , a), with n = 0,1,2,3, become a n = (a,ia°), with n = 1,2,3,4, in 
Euclidean space. The Euclidean action is then —% times the Minkowski space action. The exception to 
this is that we define the Euclidean a-matrices as in (2.8) and (2.9), So in Minkowski space a n = (—1, r) 
and a n = (— 1, — f), whereas in Euclidean space a n = (if, 1) and a n = (— if, 1). Operationally, this 
means that when Wick rotating from Minkowski space to Euclidean space we should actually replace 
the Minkowski space a-matrices by —i times the Euclidean space a-matrices. As usual we treat \ a 
and \ a as independent spinors, i.e. independent integration variables in the functional integral. The 
Euclidean space action is 

S= f d 4 xtr N { - \F 2 mn - ^F mn *F mn - 2V n \ A a n \ A + V n <\> a V n <\> a 

J l 16tH ( 419 ) 

- gX^ifaM - g\ A V aAB [<t>aX] - \g 2 \^M 2 } ■ 

As discussed above, the fact that the fermionic terms in this action are not real will not concern us 
further. For the case with Af = 2 supersymmetry, we can recover the more conventional presentation 
of the theory by defining a complex scalar field 

= 01 - *02 , 4> ] = 01 + #2 • (4.20) 
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and spinors A = A 1 and ip = A 2 . The fields $ = {(f)/ \/2, ip} form a chiral multiplet and V = {A m , A} 
a vector multiplet of Af = 1 supersymmetry. In terms of these variables, the Euclidean space action of 
the M = 2 theory (4.19) is 

S^ =a = / d 4 xtrjv{ - ±F 2 „ - ^LF mn *F mn - 2£> n A> n A - 2Vj)o n ^ + V n (f>*V n <f> 

J I- 16tH ( 42 i) 

+ 2^</#, A] + 2ig[4>\ Aty + ^ 2 [0, f ] 2 } • 

In the following, we prefer the presentation of the theory in (4.19) since this will allow us to deal with 
the theories with different numbers of supersymmetries in a unified way. 

The equations-of- motion following from (4.19) are 

V m F nm = 2g[(f) a , VM + 2ga n {X A , X A } , (4.22a) 
f\ A = gH AB [4> a ,\ B ] , (4.22b) 

t>\ A = gZaAB[<Pa,\ B ] , (4.22 C ) 

V 2 <j) a = g 2 ^, [<j> b , 0J] + gt, aAB \ A \ B + gZ AB \ A \ B ■ (4.22d) 

The supersymmetry transformations in Euclidean space are given by (4.16a)-(4.16d) by replacing the 
sigma matrices with —% times their Euclidean space versions and by replacing Minkowski space inner- 
products with Euclidean ones: 

5A n = ii A a n \ A + iUa n \ A , (4.23a) 
5\ A = w mn i A F mn - ^£ afe V B [0a, H - iVfpfatB , (4.23b) 

5X A = lO mn l A F mn - ig^abA B ^B[4>a, 4>b] ~ ^aAB P^a^ , (4.23c) 

<50 a = it A Y, aAB \ B + iUX AB \ B . (4.23d) 



IV. 2 The super- instant on at linear order 

We will now attempt to find super-instanton configurations which solve the full coupled equations-of- 
motion (4.22a)-(4.22d). First notice that the original instanton solution of the pure gauge theory (2.49) 
is a solution of the full equations-of- motion when all other fields are set to zero. In fact, we can use 
A m (x; X) as a starting point to find the more general solutions where the fermion and scalar fields are 
non-vanishing. As explained in the introduction to this chapter we will proceed perturbatively order 
by order in the coupling. In this connection note the explicit powers of g appearing on the right-hand 
side of Eqs. (4.22a)-(4.22d). 

The first step, following [38], is to expand to linear order in the fields around the bosonic instanton 
solution. To the next order, we must therefore solve the covariant Weyl equations 

f\ A = , (4.24a) 
t>X A = 0, (4.24b) 
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for the fermions, and the covariant Laplace equation 

V 2 (j) a = , (4.25) 

for the scalars. It then remains to be seen whether the original instanton solution needs to be modified 
due the source term on the right-hand side of (4.22a). 

A key result follows from the fact that Tf> has no zero modes in an instanton (rather than anti- 
instanton) background. Consequently, the solution to (4.24b) is A^ = 0. To prove this, (4.24b) implies 
fp\ A = 0. But we can expand the product of operators using the definition of A^ ) in (2.28b), so 

fp\ A = -A ( - } A A = V 2 X A + a mn F mn X A (4.26) 

and use the fact that in an instanton background F mn is self-dual. Since a mn is a projector onto the 
anti-self-dual part (2.12), we have 

PpX A = V 2 X A = . (4.27) 

But V 2 = V n V n is a positive operator. This means T> 2 X A = implies X A = and consequently there 
are no zero modes for the anti-chiral fermions. The discussion above also implies, for vanishing VEVs 
at least, that the scalar fields vanish also vanish at linear order. 

On the contrary, for X A a similar maneuver to (4.26) gives 

TpTpX A = -A«A A = V 2 X A + F mn a mn X A = . (4.28) 

In this case the second term does not vanish and so there is no reason for X A to vanish. In fact, 
we should have anticipated this since, as discussed in §11.5, the Atiyah-Singer index theorem dictates 
that the operator f> has 2kN normalizable zero modes in the k instanton background for gauge group 
SU(iV). 

IV. 2.1 Adjoint fermion zero modes 

In this section, we consider in more detail the adjoint- valued fermion zero modes in the background of 
the bosonic instanton solution; in other words, the solutions of the covariant Weyl equation (4.24a) in 
the ADHM background. Due to the linearity of the equation we can consider a single Weyl fermion A 
with TpX = 0. Fortunately, no addition work need be done since we have already solved this equation 
in §11.5 in the context of the gauge field zero modes. We can immediately write down the solutions in 
terms of the linear functions defined in (2.107): 

X a = g~ 1/2 A a (M) ee g-^fuMfKU - Ub a fMu) . (4.29) 

(The unconventional power of g" 1 ^ 2 in the definition reflects the true ^-scaling of the fermion zero 
modes, as will emerge in due course.) One difference from the gauge zero modes follows from the fact 
that A is a Grassmann quantity; hence, M.\i and are constant (N + 2k) x k and k x (N + 2k) 
matrices of Grassmann collective coordinates, respectively, which replace the c-number quantities 
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and Ca in (2.107). In addition, as indicated, the Grassmann collective coordinates do not carry the 
Weyl spinor index a. 



In order for (4.29) to be a solution of (4.24a), the Grassmann collective coordinates must satisfy the 
constraints [18] (C.14) 

A & M + MA & = . (4.30) 
Expanding A and A as in (2.45) and writing all the indices explicitly, this becomes 

M*a Xj6l = -a^M Xj , (4.31a) 
M^b% = bfM Xj . (4.31b) 

In a formal sense discussed in Ref. [20], these fermionic constraints are the "spin-|" superpartners of 
the original "spin-1" ADHM constraints (2.54a) and (2.54b), respectively. Note further that (4.31b) is 
easily solved when b is in the canonical form (2.57). With the ADHM index decomposition A = u + ia, 

we set 

M Xj = M {u+ia)j = ( ( ^ } J , M) = M Ku+ta) = {fl ju (M ,a )ji) ■ (4-32) 
Eq. (4.31b) then collapses to 

M' a = M' a (4.33) 

which allows us to eliminate J\4' in favor of M'. In terms of the variables fj,, Ji and A4' a , the "fermionic 
ADHM constraints" (4.31a) are 

Maa + aaM = ftWa + + [M' a , a' a6l ] = . (4.34) 

The quantities Ai = {fi, p>, Ai' a }, subject to (4.34), are the Grassmann-valued partners of the ADHM 
variables = {wa,a' n }. Later in §IV.5 we shall see that they are, indeed, related by supersymmetry. 

We can now count the number of independent fermion zero modes. There are 2k(N + k) indepen- 
dent Grassmann variables M. = {//,//, M 1 }, subject to 2k 2 constraints (4.34). Hence, there are 2kN 
independent zero modes, in agreement with the Index Theorem. 



IV. 2. 2 Grassmann collective coordinates and the hyper-Kahler quotient construction 



It is interesting to establish the relationship between these Grassmann collective coordinates and the 
hyper-Kahler quotient construction. A hyper-Kahler space of dimension 4n admits a preferred SU(2) x 
Sp(n) basis of tangent vectors. In particular, this leads to the notion of a symplectic tangent vector 
(B.30). With the relation between the ADHM data a„ and the symplectic variables z ia in (2.68), we 
can see that the fermionic ADHM constraints (4.34) are precisely the condition that the symplectic 
tangent vector to the mother space 

/ Uiu \ 



M 1 



(4.35) 
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is a symplectic tangent vector to the hyper-Kahler quotient space 9%. Here, i is the an index that, as 
earlier in §11.4.1, runs over {iu,ij,ui,ij}. In order to prove this we must show that the inner-product 

M% rXl" = -AiAk T r (fiw« + w> + [a'« a , M' a }) , (4.36) 

vanishes, where X r are the vectors, defined in (2.69), that generate the U(fc) action on 9JT. It is easy to 
see that this condition is equivalent to the fermionic ADHM constraints (4.34). To summarize, we have 
shown that the Grassmann collective coordinates are Grassmann-valued symplectic tangent vectors to 
the instanton moduli space 971^. 

The functional inner product of the fermion zero modes can be calculated using the same integral 
formula that we used to establish the functional inner-product of the gauge zero modes in §11.5 and 
Appendix C (Eq. (C.20)). For two such zero modes 

J d 4 xtr N A(M)A(Af) = -ytr,^^ + + M{V^ + 1)M] . (4.37) 

Notice that the extra minus sign in (4.37) relative to (2.115) arises because of the Grassmann-valued 
nature of the collective coordinates. This is precisely the inner product of symplectic tangent vectors 
on 971: 

Q(M,AT) = -4 / d 4 xtr N A(M)A(AT) . (4.38) 

Just as in the c-number sector, where we have {X^} as intrinsic coordinates on 971^, so we can 
define intrinsic Grassmann-valued symplectic tangent vectors on 97tfc. To do this we solve the fermionic 
ADHM constraints Ai = X). Since the fermionic ADHM constraints are linear in the Grassmann 

variables, the parameterization Ai(ip,X) will be linear in the intrinsic Grassmann coordinates ip\ 
i = 1, . . . ,2kN. In much the same way that the metric on 97tfc is induced by that on 97T, so the inner 
product of Grassmann-valued symplectic tangent vectors on 97t, (4.38), then induces a similar inner 
product on 97tfc. So for a pair of symplectic tangent vectors ip and 9: 

ftyPO^P = &(M(il>,X),M(e,X)) . (4.39) 

Here, Qij(X) is the symplectic matrix which appears in the expression for the metric on DJlk in (B.9). 



IV. 2. 3 Supersymmetric and superconformal zero modes 

In §11.4.2, we described how those global symmetries of the classical equations-of-motion which act non- 
trivially on the instanton are represented on the moduli space. The symmetries described there — global 
gauge and conformal — will also have an action on the Grassmann collective coordinates. In addition, 
we have supersymmetry, enhanced to the superconformal group, that acts as symmetries of the classical 
equations-of-motion. These will act on the supersymmetrized moduli space [20,38,48]. 

A special set of the fermion zero modes can be identified with the action of supersymmetry trans- 
formations on the bosonic instanton solution. As is evident from (4.23b) and (4.23c), supersymmetry 
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transformations on the purely bosonic instanton turn on the fermionic fields: 

A A = io mn i A F mn , (4.40a) 

\ A = ia mn l A F m n ■ (4.40b) 

Actually, to be more precise, in the bosonic instanton background o mn F mn = 0, so the anti-chiral 
fermions are not turned on. In particular, this means that the bosonic instanton is invariant under 
half the supersymmetries, namely the anti-chiral ones £4. Correspondingly the chiral supersymmetry 
transformations generate fermion zero modes. Using the expression for the field strength and fermion 
zero modes in the ADHM instanton background, Eqs. (2.52) and (4.29), respectively, we find 

X A = ^g- 1/2 {a mn i A ) a Uba mn bfU 

= -Aig-'tjfb^fK - b a fi A b)U (4.41) 

= g-^Ki-Aibe) ■ 

Here, we have re-scaled 

i A -> g^i A (4.42) 

so that the following equations do not have g dependence. Consequently, the chiral supersymmetry 
transformations generate fermionic zero modes with the Grassmann collective coordinates 

M A = -4i£% , M\ A = -AtC A b x M . (4.43) 

These privileged fermion zero modes are known as the "supersymmetric zero modes" . There are obvi- 
ously two such modes for each species of fermion, hence the total number of supersymmetric modes is 
equal to half the number of component supercharges of the gauge theory, i.e. 2J\f. 

In addition to these supersymmetric zero modes, there are fermionic zero modes corresponding to 
broken superconformal invariance. These transformations can be obtained by generalizing the super- 
symmetry transformations (4.23a)-(4.23d) by making the parameters £ A local in a particular way: 

= it ~ *^t A , llix) = a + V*AX aa . (4.44) 

This defines a basis of both supersymmetric and superconformal transformations generated by f A } 
and {r) A ,fjA}, respectively. The bosonic instanton breaks half the superconformal transformations, 
namely those generated by fj A . These transformations generate fermion zero modes in much the same 
way as (4.41): 

X A = -4ig- 1/2 (a mn xfj A ) a Uba mn bfU 
= 4ig' 1/2 U(bxf] A fb a - bJf] A xPjU 

= -Aig-^ufa^fba - b a ffi A a)u 
= g-^K{-^ A ) . 

where in the second to last equality, we used the ADHM relations (2.47) which imply Ubx = —\Ja and 
xbU = —all. Consequently, the associated Grassmann collective coordinates are 

M M = -±ia Xi at , M$ = -Ai^af . (4.46) 

As for the supersymmetric modes, there are two such modes for each species of fermion, hence the total 
number of "superconformal modes" is equal again to 2J\f. 



(4.45) 
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IV. 3 Going beyond linear order: the quasi-instanton 



In order to evaluate correlation functions at their first non-vanishing order in the semi-classical ex- 
pansion, we will find it necessary to go beyond the linear order analysis of the previous subsection. 
The systematics of the semi-classical expansion can be deduced from the equations-of-motion (4.22a)- 
(4.22d) and the fact that the fermion zero modes \ A are order 0(g~ l l 2 ) and the gauge field is 0(g^ 1 ). 
Schematically, the various fields have the following ^-expansions: 

A m = g- 1 ^ + g A% + g s A% + • • • , 

X A = g -l/2 X ( 0) A + g 3/2 xW A + g 7/2 x{2) A + . . . ? 

Aa = g l/2 \ A * + / /2 A« + g^XT + • • • , ^ 
a = 9°4>f + 9 2 € ] + 9^ + ■■■ ■ 

In the above, 

A™ = Ud m U , A (())A = A(M A ) . (4.48) 
Note in the M = 1 theory there are no scalar fields, hence the leading order instanton solution, 

A rn = g- l A% , \ A = g- l ' 2 \^ A , \ A = , (4.49) 

is exact. We will also see shortly that in the absence of scalar VEVs the leading-order instanton 
configuration of the Af = 2 theory, 

A m = g~ l A^ , \ A = g- 1 ' 2 ^ , \ A = , <j> a = g^f , (4.50) 

is again an exact solution. This will no longer be the case when VEVs are turned on. The M = 4 case 
is more subtle, and we will treat it carefully. 

When the expansions are substituting into the action, the latter is 

„ Sn 2 k 



where we have defined 



+ lke + Sg + O(g 2 ) , (4.51) 

g 2 

S = J d 4 xtr N {v n <t>fV n <t>f - \^ aAB [^\ \^ B }) , (4.52) 

which will play an important role in what follows. So if we wish to work to leading order in g, we only 
need to solve the equations-of-motion to order g 23 From (4.51), it follows we need the expression for 
the scalar field to order g°. This is given by the solution of equation-of-motion (4.22d) with only the 
source bi-linear in the fermion zero modes included: 

V 2 <p a = g^ aAB \ A \ B • (4.53) 



23 The higher order effects are, in any case, mixed non-trivially with the quantum effects from the fluctuations as we 
shall see in §V. The fluctuations actually also contribute determinant factors at O(g ). However, since our theory is 
supersymmetric this is just a constant. 
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Since X A is 0(g~ 1 / 2 ) the solution is O(g ). The solution to (4.53) is found in Appendix C (Eq. (C.25)) 
with arbitrary VEV. Setting the VEV to zero, the solution has the form [21,25] 

a = -\T laAB UM A fM B U + U f ° [A ™ ° [ ™ ) U , (4.54) 

\U[2fe]x[JV] <-Pa J- [2] x [2] / 

where the k x k matrices ip a are 

(p a = \t aAB L- 1 (M A M B ) . (4.55) 

For the case of M = 2, one can see that (4.54) implies that the holomorphic field <fi is non-trivial, while 
$ remains zero. In other words, we can see that the lack of a reality condition on the fermions means 
that the scalar field in the instanton background also violates its reality condition. The same is true in 
the N = 4 theory. However, this violation of the reality condition only occurs in terms are quadratic 
in the Grassmann collective coordinates and so will not affect the convergence of the integrals over 
the fluctuations of the scalar field (which will still satisfy the usual reality condition). The fact that 
the reality properties of fields are violated in the instanton background by polynomial factors in the 
Grassmann collective coordinates will be a constant feature of the supersymmetric instanton calculus. 

For the applications considered below to evaluating correlation functions at leading semi-classical 
order, it turns out that we need go no further in iterating the equations-of-motion. However, it is 
instructive to go one step further by solving for the anti-chiral fermions at order g 1 ^ 2 . At this order it 
appears from (4.22c) that the source term for the anti-chiral fermion turns on. However, for the M = 2 
theories there is a major simplification compared with the M = 4 theories. From (4.53) we can see that 
only the components of (f> a which are non-trivial are of the form <p a = (fi AB T> aAB . But we recall from 
(4.14) for the N = 2 theories 

^aAB = tAB^a , S a = 1) . (4.56) 

Consequently 

N = 2 : ^aAB^aCD = t AB t C D^a = (4.57) 

and so the source term for the anti-chiral fermions vanishes and the anti-chiral fermion remain zero. 
This should be contrasted with the situation in the M = 4 theories where one can show from the 
definition (4.15) 

M = 4 : ZaAB^aCD = 2e ABCD ■ (4.58) 
Hence, in these theories the source for is non-vanishing and the anti-chiral fermions are non-trivial. 

Another way to see the difference between the Af — 2 and M = 4 theories is to note that the former 
has an abelian factor in their i?-symmetry group under which the fields have the following charges 

q(A n )=0, g(A) = l, <Z(A) = -1, g(0) = 2, q(tf) = -2 . (4.59) 

In the instanton background, the charge 2 component is non-trivial, however, it is <ft, the charge —2 
component, that couples in the source for A. In the H — 4 case there is no abelian i?-symmetry to pro- 
vide a similar selection rule of this kind. The existence of the abelian i?-symmetry in the M = 2 theory 
means that no other sources are turned on and so there is fully-fledged supersymmetric generalization of 
the bosonic instanton solution which depends on 4kN Grassmann collective coordinates. The solution 
has A n , \ A and <p a all non-trivial, where A n is the original gauge field, \ A are the zero modes of the 
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Weyl equation and 4> a is the solution (4.54). It is easy to verify that this "supersymmetric instanton" 
is degenerate with the original bosonic instanton and so has action —2-kit. In this case, therefore, the 
instanton effective action S in (4.51) vanishes, as do all terms higher order in g. 

The picture in the M = 4 theory is much more subtle. Naively one would think that a supersym- 
metric instanton with 8kN Grassmann collective coordinates exists also in this case, albeit that it is 
much harder to find. We will argue that this is not the case. Remarkably, we will find that most of 
the Grassmann collective coordinates, exactly the 8k N — 16 of them, are not true collective coordi- 
nates in the sense of parameterizing a space of solutions of the classical equations-of-motion. In view 
of this, we shall call them quasi- collective coordinates and associate to them a quasi-instanton. They 
correspond to zero modes of the linearized system that are lifted by interactions. As a consequence, 
the action of the theory evaluated on the instanton solution (4.51), actually depends non-trivially on 
the quasi-collective coordinate modes. The only "genuine" collective coordinates are those protected 
by symmetries of the theory. For the M = 4 theory these number sixteen in total, corresponding to the 
supersymmetry and superconformal generators which, as explained in §IV.2.3, act non-trivially on the 
instanton. These zero modes are protected by symmetries and cannot be lifted by interactions. Hence 
the exact instanton solution of the equations-of-motion of the M = 4 theory contains only sixteen Grass- 
mann collective coordinates. The expressions for the multiplet of fields of this solution can be found 
by acting on the bosonic instanton with a series of supersymmetry transformations (4.23a)-(4.23d), 
with the x-dependent Grassmann parameters £ A in (4.44) and with the re-scaling (4.42). Iterating this 
"sweeping-out" procedure to fourth order yields 

A i(j ^ 0~mn£, {x)F mn i 

(p a = -^gt, aAB ^ A (x)a mn ^ B (x)F mn , 
Xa = 1 p9 3/2 e A BCD(fF mn ^ B (x)^ c (x)a mn e(x)} , 
A m = A m - \g 2 e AB c D [i A {x)a mn i B {x)\ [^ C (x)a k ^ D (x)]V n F kl . 

However, this exact instanton solution is not the most convenient starting point for the semi-classical 
approximation to the functional integral. Integrating over the quadratic fluctuations in the background 
of this exact solution, one would have to introduce the additional 8kN — 16 fermion zero modes of the 
Dirac operator. These modes will couple to the scalar field fluctuation and in order to lift them one 
would have to re-sum tree-level perturbative contributions in the instanton background. A much faster 
and more elegant way of addressing this problem is to modify the instanton background by including in 
it from the beginning all 8kN fermion zero modes. Thus, as in Ref. [21], we will always work with the 
quasi-instanton configuration in the Af = 4 theory which is not an exact solution, but does not require 
perturbation theory to lift the quasi-zero modes. 

The subtlety of the quasi-instanton of the M = 4 theory appears when we try to solve for the anti- 
chiral fermions (4.22c). In Appendix C (Eq. (C.34)), we show that the right-hand side of the (4.22c) 
can be decomposed as 

(?S aAB [0 a , \ B ] = g 1,2 {lp4>A + A (A/a)) . (4.61) 

Here, Ma is a Grassmann odd function of the collective coordinates which satisfies the fermionic ADHM 
constraints (4.31a) and (4.31b) whose form is written down in Appendix C, Eq. (C.44). 24 This means 
24 Actually the Appendix considers the more general case when the scalar fields have VEVs. 
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that the second term in (4.61) is a zero mode of f>. The expression for ip A in (4.61) is also given 
in Appendix C, Eq. (C.38). Notice that the right-hand side of (4.61) has a component in the kernel 
of lp. However this kernel is not in the image of Tp, in other words there is no A^ such that Tp\ A — 
g 1 / 2 (lpijjA + EU). Hence it is impossible to solve this equation-of- motion. The best that can be done is 
to set 

Aa = g l/2 i> A , (4.62) 

which gives the leading order 0(g 1 ^ 2 ) expression for the anti-chiral fermion in the instanton background. 
In particular, from the expression for tp A (C.38), this solution contains a cubic dependence on the 
Grassmann collective coordinates. 

We shall see in §V, the failure to solve the anti-chiral fermion equation-of-motion is a symptom of 
the fact that, all but the sixteen Grassmann collective coordinates associated to broken supersymmetric 
and superconformal invariance, are lifted by interactions and so are only guasz-collective coordinates. 
The leading order effect is captured by the order g° term in the action (4.52). Even though an exact 
supersymmetric instanton does not exist in the M = 4 case, dependent on all AkN Grassmann collective 
coordinates, the approximate solution {A m , \ A , <p a }, the "quasi-instanton" , is sufficient to capture semi- 
classical contributions to the leading order in g. The expression for the anti-chiral fermion is then only 
needed when one considers correlations functions with explicit insertions of A^. 



IV. 4 Scalar VEVs and constrained instantons 



In this section, we examine instantons in cases where the scalar fields have non-vanishing vacuum 
expectation values. From the outset, we emphasized that instantons are a semi-classical phenomenon 
and therefore are only expected to describe the physics of these theories in a weakly-coupled phase. 
The Af = 4 theory has a weakly coupled regime, obtained simply by taking g — which does not run — to 
be arbitrarily small. In this case the theory is in a non-abelian Coulomb phase and the semi-classical 
approximation is reliable. However, the pure Af = 1 and Af = 2 theories (with vanishing VEVs in the 
latter) are strongly interacting. These theories can be rendered weakly coupled by breaking the gauge 
symmetry via the Higgs mechanism. Either the gauge symmetry is broken to an abelian subgroup, 
yielding a Coulomb phase, or it is broken completely, yielding a Higgs phase. In the Af = 1 theory this 
can only be achieved by adding matter fields, a subject that we will pursue in §VI.3 and §VII. The 
Af = 2, like the Af = 4, theory has adjoint- valued scalar fields, 4> a , which can develop a VEV driving the 
theory into an abelian Coulomb phase. By taking the VEV to be large the theory is weakly coupled and 
semi-classical methods can be rigorously justified. As mentioned in the introduction to this chapter, 
scalar VEVs have an unfortunate side effect on instantons: strictly speaking they no longer exist! The 
way to resurrect them and make sense of the theory in the Coulomb or Higgs phase was worked out 
some time ago by Affleck [49] (see also [50]). Instantons are replaced by "constrained instantons" in 
a rather technically demanding formalism. But it turns out that the constrained instanton formalism 
is a paper tiger: working to lowest order in the semi-classical expansion with constrained instantons 
involves only a relatively mild generalization of the instanton calculus and, moreover, one which has a 
nice geometrical interpretation in the moduli space picture. Roughly speaking, instantons are no longer 
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true minima of the action and a potential develops on 9JT fc : instantons now have non-trivial action 
as well as entropy. In this sense constrained instanton are examples of the more general notion of a 
quasi-instanton that we have already seen in the J\f = 4 theory. 



IV. 4.1 Constrained instantons on the Coulomb branch 

The M = 2 and M = 4 theories contain two and six real adjoint- valued scalar fields, respectively. 
The classical potentials of these theories have flat directions along which these scalars develop VEVs 
breaking the gauge group to its maximal abelian subgroup by the adjoint Higgs mechanism. In the 
case of gauge group SU(iV) the unbroken subgroup is U(l) iV ~ 1 . Without loss of generality, we can 
simultaneously diagonalize all the components of the VEVs and we label the elements 



w ith J2 u= i(<j>a)u = for tracelessness. Here, and in the following, the superscript "0" on a scalar field 
denotes a vacuum expectation value. Up to Weyl transformations, these VEVs parameterize a moduli 
space of inequivalent vacua known as the Coulomb branch. 

After the Higgs mechanism has done its work, the diagonal components of all the fields remain 
massless whereas the off-diagonal components gain masses | — {4>a)v\- The M = 2 theory is 
asymptotically free with dynamical scale A. The running of the coupling is cut-off in the IR at a 
scale set by the masses of the off-diagonal components. Thus, as long as we choose VEVs such that 
|(0a)« — (0a)J ^ A, for all u, v, the theory is weakly coupled at all length scales and semi-classical 
(instanton) methods should be reliable. In contrast, the coupling constant of the J\f = 4 theory does 
not run and we may achieve weak coupling simply by setting g 2 <C 1, either on the Coulomb branch or 
at the conformal point where the non-abelian gauge symmetry is restored. 

As we mentioned in the introduction to this Chapter, in the background of scalar VEVs, instantons 
are no longer exact solutions of the equations-of-motion due to Derrick's Theorem [51]. The action can 
always be lowered by shrinking an instanton, so the the size of the configuration cannot be a genuine 
modulus when the VEVs are turned on. The way to implement the semi-classical approximation around 
instanton quasi-solutions in theories with symmetry breaking was developed by Affleck [49,50]. For 
simplicity, we review this approach in the context of a single BPST instanton of scale size p. The basic 
idea is to introduce a new operator, or "Affleck constraint", into the action by means of a Faddeev- 
Popov insertion of unity. If this operator is of suitably high dimension Derrick's theorem is avoided and 
instantons stabilize at a fixed scale size p. The integration over the Faddeev-Popov Lagrange multiplier 
can then be traded off for the integration over p. The now-stable solutions are known as constrained 
instantons. Of course, the detailed shape of the constrained instanton depends in a complicated way 
on one's choice of constraint, but certain important features remain constraint independent, namely: 

(i) The short-distance regime, x <C l/(g<f) ). 25 In this regime the equations-of-motion can be solved 
perturbatively in g 2 p 2 (f>°; since ultimately, as seen ex post facto in §V2, the integration over scale size 
25 Hcrc, 4> a is the characteristic scale of the VEVs. 




(4.63) 
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is dominated by p < 1/0° this is tantamount to perturbation theory in g. As the constraints do not 
enter into these equations until some high order, the first few terms in this expansion are robust. In 
particular, to leading order in the semi-classical approximation, the gauge fields and fermions are equal 
to their BPST expressions while the scalar fields undergo a minor modification to take account of the 
VEV. 

(ii) The long-distance regime, x 3> l/(g<f>°). The long-distance "tail" of the instanton reflects the 
Higgs mechanism. In the model at hand, the instanton component fields which gain a mass via the 
Higgs mechanism decay exponentially. In contrast, the diagonal components of the fields fall off as 
powers of p 2 /x 2 . It is an important assumption of the constrained instanton method that to leading 
order in the semi-classical approximation, the long-range behaviour of these massless fields is simply an 
extrapolation of the BPST core. 26 So for the massless components the BPST form of the solution is all 
that one needs to discuss the instanton on all length scales (at leading order). However, even for the 
massive fields the BPST form often suffices, even though it is not correct at large distance. The reason 
is that the error made in using the BPST form rather than the actual exponential fall off is higher order 
in the coupling. 

The most important conclusion of the constrained instanton method is that to leading order, small 
constrained instantons are well approximated by ordinary BPST instantons. But since these are the 
ones that are favoured by the now size- sensitive action of the constrained instanton, little error is made 
by replacing the constrained instanton by a conventional instanton. We will apply the same reasoning 
to the case of arbitrary topological charge to find that the core of the required constrained instanton is 
more or less the ADHM instanton that we have constructed: the gauge field assumes its ADHM form, 
(2.49), and the fermions \ A are the linear combination of zero modes of lp, (4.29). The scalar field 
continues to obey (4.53) but now the boundary condition on a is that it must approach the VEV 0° 
at large distance from the instanton. The general solution with VEV [25] is derived in Appendix C 
(Eq. (C.25)) 

a = -\^ aA BUM A fM B U + U )u, (4.64) 

\ u (f a 1 [ 2 ] x [2] / 

with 

Va = L 1 ^ aAB M A M B + ™ d 0> d ) . (4.65) 

That this has the requisite boundary condition (4.63) can be verified using the asymptotic formulae in 
§11.4.3. 

As long as we work to leading order in g, the Affleck constraint does not explicitly appear and 
furthermore there is no need to iterate the instanton solution any further. Just as in the Af = 4 theory 
with zero VEV, the quasi-instanton solution {A m , \ A , a } is sufficient to capture the leading order 
semi-classical approximation of the functional integral, as we describe in §V. 

26 In principle, this simple patching of the short and long distance behaviour is modified at higher order in a way that 
is dependent on the precise form of the Affleck constraint. 
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IV. 5 Collective coordinate supersymmetry 

In §IV.2.3, we saw that chiral supersymmetry transformations on the bosonic instanton generate fermion 
zero modes: these are the supersymmetries that are broken by the purely bosonic instanton. Once, the 
fermion zero modes are turned on, the other half of the supersymmetry generators, the anti-chiral 
ones which left unbroken the bosonic instanton, now act non-trivially on the super-instanton. It turns 
out, as we uncover in this section, that the supersymmetry transformations on the super-instanton 
can be traded for supersymmetry transformations on the collective coordinates themselves. In fact 
the unbroken supersymmetries of the ADHM background are linearly realized on the bosonic and 
fermionic collective coordinates. This is an example of a general feature of BPS saturated solitons and 
corresponding higher- dimensional extended objects (i.e. branes). In all of these cases the unbroken 
supersymmetries are linearly realized in the world-volume theory. Invariance under these symmetries 
is an important constraint which must be satisfied by the super-instanton measure we will construct in 
the next section. 

To find the transformations, we need to consider the supersymmetry variation of the gauge field 
A, m and fermions \ A in the background of the non-linear supersymmetric instanton solution. By the 
latter, we mean the solution with A m and \ A equal to their ADHM forms, (2.49) and (4.29), and 
with the scalar fields given by (4.64). In the M = 4 case, or on the Coulomb branch of the M = 2 
theory, this quasi-instanton configuration, as we have already described in §IV.3 and §IV.4, is only 
an approximate solution to the equations-of-motion, but, nevertheless, provides a convenient way of 
capturing the leading order semi-classical contribution to the functional integral. As a symptom of the 
non-exactness of the instanton solution in these cases, supersymmetry transformations on the fields, as 
well as transforming the collective coordinates, also turn on components of the fields at a higher order 
in the semi-classical expansion. We have already seen this phenomena as the "sweeping-out" procedure 
that led to the expressions for a , and A m in (4.60). 

We start by considering the supersymmetry variation of the gauge field (4.23a). Since the anti-chiral 
fermions vanish, we have 

6 A* = 2ig~ 1 /% A X A = 2g~ 1 A a (i^ A M A ) . (4.66) 
Here, for convenience we have re-scaled 

U - g~ 1/2 U . (4.67) 

Comparing this with (2.110), the variation of the gauge field up to a local gauge transformation, under 
a variation of the c-number collective coordinates, we deduce the simple rule 

Sa A = i£aAM A , 5a« = iC A M A . (4.68) 

This leaves the transformations of the Grassmann collective coordinates which are deduced by con- 
sidering the variation of the fermions (4.23b). We have already shown how the first term lifts to the 
collective coordinates in §IV.2.3 but for consistency and contrary to (4.69), we should re-scale 

e - g i/2 e . (4.69) 
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The second term involves an expression which is higher-order in the VEVs and Grassmann collective 
coordinates of 0(g 3 ^ 2 ). In other words this term does not contribute to 5\ wA but rather the next term 
fi\w A i n th expansion (4.47). This is example of the sweeping-out procedure turning on a higher-order 
term in the semi-classical expansion of a field. Note with vanishing VEVs and M < 4 supersymmetry 
this term vanishes. The final term is 0(g~ l l 2 ) and in Appendix C (Eq. (C.45)), we show how it lifts 
to a variation of the Grassmann collective coordinates. Putting the two variations, from the first and 
third terms, together, we have 

5M A = -Aii A b a + 2*£f B C a d% , SM A = -AtC A K + 2iE£ B Z &B C2 , (4.70) 

where 

CaA = ( o° v0 a " " a ^ a ' C: = ^ ( o° ^°„) " ^ ' (471) 

where ip a was defined in (4.65) in the context of the scalar field equation. Notice that with the re-scaling 
(4.67) and (4.69), all ^-dependence drops out of (4.70). 

The anti-chiral supersymmetry transformation also turn on the anti-chiral fermions at 0(g 1 ^ 2 ): 

5\ A = -ig 1,2 ^ abA B lB^a, H ■ (4.72) 

Again, this is a symptom of the fact that the quasi-instanton is not an exact solution to the equation- 
of- motion. 27 Nevertheless, the supersymmetry transformations on the collective coordinates that we 
have derived in (4.68) and (4.70) will turn out to be symmetries of the leading-order semi-classical 
approximation of the functional integral. 



Except for TV < 4, with zero VEVs, in which case (4.72) vanishes. 
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V The Supersymmetric Collective Coordinate Integral 



This Chapter is a companion to the previous one and in it we consider how the supersymmetric instan- 
ton is used to implement the semi-classical approximation of the functional integral in the context of a 
supersymmetric gauge theory. As we have already seen, there are extra subtleties in the supersymmetric 
case arising from the existence of quasi-zero modes. In this Chapter we show how the procedure that 
we adopted to solve the equations-of-motion is precisely in accord with an expansion in g 2 . First of all, 
in §V.l we describe how the semi-classical approximation of the functional integral leads to a supersym- 
metrized version of the collective coordinate integral in which the Grassmann collective coordinates are 
also integrated over. We show that, even though the non-trivial parts of the fluctuation determinants 
cancel between bosonic and fermionic fluctuations, there is still, in general, a non-trivial integrand 
which arises as a consequence of quasi-zero modes. In fact the integrand involves the exponential of 
the "instanton effective action" , a suitably supersymmetrized potential on the instanton moduli space. 
In the case of constrained instantons the instanton effective action penalizes large instantons but, even 
when the VEVs vanish it is a non-trivial function of the Grassmann collective coordinates in the M = 4 
theory. The instanton effective action is constructed and analysed in §V.2. In §V.3 we find an expres- 
sion for the supersymmetric volume form on the instanton moduli space. As in the pure gauge theory, 
the hyper-Kahler quotient construction plays a central role here. Finally, in §V.3.1, we show how the 
supersymmetry of the underlying field theory is manifested on the collective coordinates. 



V.l The supersymmetric collective coordinate measure 



In order to implement the semi-classical approximation we must expand around the bosonic instanton 
solution. In this section, we will initially consider the case where the VEVs vanish, so avoiding the 
complications of the constrained instanton. The gauge field is expanded as in (3.1) and (3.5) where we 
recall that the fluctuations are split into the zero mode piece and the component orthogonal to the zero 
modes; the latter being denoted A n . The fermions are treated in a similar way: we separate out the 
zero modes of lp and write the chiral fermion as 

\ A ( X ) = g- l / 2 \^ A ( X] X, ij A ) + \ A (x- X, V) • (5.1) 

The 2kN Grassmann collective coordinates, for each species, are denoted by ip A . Since the anti-chiral 
fermions and scalar field have no zero modes in an instanton background we continue to denote them 
as \a and <j) a . 



The functional integral over the fermions can be factorized into integrals over the Grassmann col- 
lective coordinates ip A and the non-zero mode fluctuations; schematically 28 

r N r M ( y-r2kN j I LA -) 

/nw^ w /n {^kwm ^ ^} ■ < 5 - 2 > 

28 The factor of g kNJ ^ arises from the factor of g~ Y l 2 included in the definition of each fermion zero modes in (4.29). 



65 



Here, Q(X) is the anti-symmetric 2kN x 2kN matrix defined by the functional inner-product of the 
zero-modes (4.39): 

J d^xtiN \ m (x; X^ A )\ m (x; X,*P B ) = -\5 AB ^{X)i> iA ^ B (5.3) 

and the Pfaffian ensures that the integral is invariant under re-parameterizations of the Grassmann 
collective coordinates. 

We now plug the expansion of the fields into the action of theory we obtain 

S[g- l A^ + A m , g- l / 2 \ {0)A + \ A , \ A , a ] = -2mkr + + S int . (5.4) 
Here, the second term denotes the kinetic terms for the non-zero mode fluctuations: 29 

Skin = j d A xti N { - \A &a A^jA p& - 2V n \ A u n \ A + V n (f> a V n (f) a } (5.5) 

and the third term includes the interactions between the zero and non-zero modes: 
Smt = j d*xtv N { - \^ aAB [<p a , \ mB ] - 2g 1 / 2 [A n , X A ]a n X (0)A - 2g 1 / 2 X^ aAB [<p a , \ B ] + • • ■ } . (5.6) 

The remaining terms, whose presence is indicated by the ellipsis are higher order in g. We have also 
not written down any terms involving the zero-mode piece of the gauge field because, as we saw in 
§111.1, once the integrals of the expansion parameters of the zero modes, the £ M in (3.5), are traded for 
integrals over the collective coordinates X**, at leading order the ^ are set to zero. 

We can now integrate over the non-zero modes {A m , \ A , \ A , a }, and ghosts. This defines a kind of 
effective action on the collective coordinates S e s, which we call the instanton effective action: 

e S cB def e2mk r J ^ [ rf6 ] ^] ^] ^] e -S kin -S int -5 gh _ ( 5 J ) 

The instanton effective action is only non-vanishing in the M = 4 theory (or more generally in the 
M = 1 and M = 2 theories with non-vanishing VEVs) when the supersymmetric instanton is not 
an exact solution to the equations of motion. We can think of (5.5) and (5.6) as specifying a set of 
Feynman rules which determines S e s perturbatively in g. Only the non-zero mode pieces of the fields 
actually propagate and, in particular, the zero modes of the fermions A (0)A are non-propagating and 
act as sources. When we integrate out the fluctuations the second and third terms in (5.6) will not 
contribute at the leading g° order since they are order g 1 ^ 2 ; only the first term which is linear in the 
fluctuation of the scalar field is relevant. We can include the leading order effect in an efficient way via 
a shift of the scalar field by 0^ o) , the solution to the equation 

V 2 ^ = Z aAB \ wA \ mB . (5.8) 

This equation is identical to one which we solved in §IV.3 and yields an expression for <p a which is bi- 
linear in the Grassmann collective coordinates. So when working to leading order in g, it is convenient 

29 In the following, as previously, all covariant derivatives are defined with respect to the bosonic instanton solution 
(2.49). 
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to think of the background configuration of the instanton as being the multiplet {A m , \ A , (f) a }, defined 
in (2.49), (4.29) and (4.64). Notice that as we explained in the introduction to Chapter IV, the solution 
for the scalar field has terms bilinear the Grassmann collective coordinates which violates the reality 
condition on the field: <p a is no longer the Hermitian conjugate of <f> a . However, as we explained, there 
is no inconsistency. 

Once we have shifted the scalar field, the fluctuations can be integrated out. At leading order this 
yields the usual determinant factors. The gauge field and ghosts have already been dealt with in §111.1, 
while for the fermions we obtain 

(° *) s id*- (° X) r = i det ' A<+> ■ detAw i i/4 • < 5 - 9 > 

for each flavour of fermion and where A^' are defined in (2.28a) and (2.28b). In (5.9), the prime, 
as usual, indicates that A^ + ^ has zero modes and the determinant must be taken over the subspace 
orthogonal to the zero mode space. Finally, the integrals over the scalar field fluctuation a simply 
gives an additional factor of [det(— V 2 )] 1 ^. Putting the determinant factors together we have 



det(-£> 2 ) 



det' A(+) 

using (2.29) valid in an instanton background. 



det'AW| Ar/4 .|detA^| Ar/4 .|det(-P 2 )| 1 - Ar = "'^ , (5.10) 



det A(-) 



There are two leading-order g° contributions to S e Q. The first contribution arises from evaluating 
the action of the theory on the multiplet {A m , \ A , <f) a }. This is simply the quantity S defined in (4.52). 
The second contribution comes from the determinants (5.10). Putting these together we have 

S cS = -2mkr + {£ + (AT/4 - l)log [det' /det A^]}g° + 0(g 2 ) (5.11) 

As we described in §111.3, the ratio of determinants in (5.11) is simply a power of the Pauli-Villars mass 
scale: 

det/A(+) _ (5 12) 

detA(-)"^ • (5 ' 12) 

This cancelling of the determinants, up to a constant factor, is perhaps the biggest simplification that 
occurs for instantons in a supersymmetric gauge theory [34]. Recall that S is an expression quartic 
in the Grassmann collective coordinates. In the M = 1 theory, there are no scalar fields, so this term 
cannot appear. In the M = 2 theory with zero VEVs, this term also vanishes because the multiplet 
{A m , \ A , <p a } is an exact solution to the equations-of-motion. With zero VEVs, it is only in the M = 4 
theory that this contribution to S is non-vanishing as we shall find by explicit evaluation in §V2. 

Including the bosonic parts of the functional integral as described in §111.1, the final expression for 
the leading-order semi-classical approximation of the functional integral in the charge-/c sector is 



So the final expression for the leading-order semi-classical approximation of the functional integral 
in the /c-instanton sector is simply a multiple of the supersymmetrized volume form on the instanton 



67 



/> 



moduli space 9J4 with an integrand involving the instanton effective action S: 

[dA] [d\] [d<j>] [db] [dc] e~ s ^ (V\ kN( ~ 4 - M \2*ikT I w (AO e -S > (5. 14 ) 

where we have defined an M supersymmetric volume form on DJlk'- 

I „ r Vdet g (X) jj X nil „ (5 15) 

Jm, J [Pfaffi!i(X)f i,\V2il\l\ 

We will refer to the quantity 

2#° = / u w e~ § , (5.16) 

as the instanton partition function since it has the form of a partition function of a zero dimensional field 
theory. Later in Chapter X we will see that for M > 1 it can be viewed as the dimensional reduction of 
the partition function of a higher-dimensional field theory. Specifically for M = 2, respectively M = 4, 
the field theory is a two-dimensional, respectively four-dimensional, a-model with 971^ as the target 
space. This point-of-view leads very naturally to the relation of the instanton calculus to the dynamics 
of D-branes in string theory described in §X.3. 

In the pure Af = 1 and M = 2 theories, the pre-factors of the collective coordinates measures (5.14) 
can be related to the renormalization group invariant A-parameters. The point is that the coupling g 
must run with the Pauli-Villars mass scale in such a way that the combination in front of the measure 
is a renormalization group invariant. This defines the A-parameters in the Pauli-Villars scheme: 

A^L = ^ N g^Y 2N e^ 2 l^ 2+w , A 2 JL 2 = ^ e ~^l^?^ . ( 5 . 17 ) 

(Notice that the powers of g in (5.14) and (5.17) do not match. The additional powers of g come from 
the insertions when one calculates correlations functions.) On the contrary, the J\f = 4 theory is finite, 
the coupling does not run and as a consequence the integration measure (5.14) is independent of the 
Pauli-Villars mass scale ji. 



V.2 The instanton effective action 



In this section, we evaluate the leading order contribution to the instanton effective action S in (4.52). 
When the scalar fields have a non-trivial VEV, the semi-classical approximation proceeds via Affleck's 
constraint method as explained in §IV.4. In this case, the scale sizes of instantons cease to be true moduli 
and the instanton effective action will be a non-trivial function on 9Jl fc which breaks superconformal 
invariance. 30 However, the now gnasi-collective coordinates are still to be integrated over in the semi- 
classical approximation of the functional integral. Furthermore, the Affleck constraint, at leading order, 
does not explicitly appear and further discussion of it is unnecessary. To leading order, the net effect 
of introducing the VEV is to change the boundary condition on <fr a , as indicated in (4.64) and this will 
feed into the instanton effective action S in a way we now calculate. 

30 In the Af = 4 case, this is in addition to the non-trivial instanton effective action that is present even when the VEVs 
vanish. 
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With an integration by parts together with the scalar equation-of-motion, Eq. (4.52) may be re-cast 



as' 
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S = J d A X {<9„tr7V (0a^n0a) ~ \g^N X^aAB^a, >?] } 



(5.18) 



The first term, being a total derivative may be converted to a surface integral over the sphere at infinity 
in spacetime. Since it is gauge invariant we can evaluate it in any convenient gauge. In particular, in 
singular gauge defined in §11.4.3, Eqs. (2.105) and (4.64) imply, in the limit of large x, 



—^>n<Pa 

X 



-g QSoAB^V + W & W & <jP a + <jP a W & W & - 2w & <f a W & ) . 



X 



Hence, the first term in (5.18) is 



47r 2 tr fc 



(5.19) 



(5.20) 



The second term can be evaluated by using the identity (C.34) in Appendix C: 

V aAB [<t>a, A(Ai B )] = TpipA + A(AQ • (5.21) 

Then 

-\ J d 4 xtT N A(M A )(f^A + A(M A )) =-\J d ^ x (dntT N A(M A ) a n $ A + tr N A(M A )A(M A )) ■ (5.22) 

In the first term, we have used the fact f>\ A = to pull the derivative outside the trace. One can 
verify that for large x, ~ x~ 2 and A ~ x~ 3 ; hence, the first term on the right-hand side of (5.22) gives 
a vanishing contribution at infinity and may be dropped. The second term can be evaluated using the 
inner-product formula (4.37): 



(5.23) 



-\ J d 4 xtr N A(M A )A(M A ) = -^t Ik [M A {V 00 + l)M A + M A {V 00 + l)M A ] 

= 7T 2 S aAB tr fc [fX A <P° a fl B - M A M B p a ] , 

where we used the expression for Ma in (C.44) Appendix C: 

Ma = -^aA B { M B - M B Va } + 2 (° ^ a A - 1a & Ql ■ (5.24) 



Here, Q aA is a quantity needed to ensure that Ma satisfy the fermionic ADHM constraints (4.34), but 
which, as one can show by explicit substitution, does not contribute to (5.23). 

Assembling all the pieces, the leading-order contribution to the instanton effective action is 

S = 47T 2 tr fc {|S a ^ A 0°/i B + W*<l> J a W & - VaL^a) 

(5.25) 



47r 2 tr fc {iS aAB /i A 0°/i B + w & <t>° a <t>lw & 



(^ aAB M A M B + w & <f>° a w & } L 1 (\^ aCD M c M D + w^° a w $ ) } . 



31 



In the following {A mi \ A ,cf) a } take the ADHM expressions (2.49), (4.29) and (4.64). 
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One can verify explicitly that the expression above is invariant under the supersymmetry transforma- 
tions (4.68) and (4.70). Notice in the M = 2 theory when the VEVs vanish S = 0. However, in 
the M = 4 theory with vanishing VEVs, S remains a non-trivial expression quartic in the Grassmann 
collective coordinates: 

S = -^-e ABCD tT k (M A M B L- 1 M c M D ) . (5.26) 

This reflects the fact that in the Af = 4 theory the super-instanton is not an exact solution of the 
equat ions-of-mot ion . 



V.2.1 Geometric interpretation 



The terms in the instanton effective (5.25) have an elegant interpretation in terms of the geometry of 
DJlk- To start with, the quartic coupling is precisely the coupling of the Grassmann- valued symplectic 
tangent vectors M. A to the symplectic curvature of Tlk- 

S = ^e ABCD R(M A , M B , M c , M D ) = ^e ABCD Ri^ iA ^ B ^ C ^ D ■ (5.27) 

The second expression is written in terms of the intrinsic coordinates ip . Here, R is the symplectic 
curvature of the hyper-Kahler quotient space DJlk- It is a totally symmetric tensor in the Sp(n) indices. 
In Appendix B we derive a general formula (B.48) for the symplectic curvature of a hyper-Kahler 
quotient in terms of the curvature and connection of the mother space DJl. Using the relation (4.35) 
and the explicit representation of the generators of the U(fc) symmetry in Eq. (2.69), we find 

M U (&T r )..M ]B = -An 2 itT k T r (M A M B - M B M A ) . (5.28) 

Hence, using the formula for the symplectic curvature (B.48) (in terms of the ADHM variables M lA = 
M [ (ip A ,X)) and the definition of L rs in Eq. (2.124), we have 

^e ABCD Ri^ A ^ B ^ C ^ D 
yb 

= y^cd £ [M u (nT r ) ri M> B ] l; 1 [M^ c (nT s ) n M lD ] (5 . 29) 

rs 

= -Y^ABCDtr k (M A M B L- 1 M c M D ) . 
which is precisely (5.26). 

The pieces of the instanton effective action which do not depend on the Grassmann collective co- 
ordinates also have a very concrete geometrical interpretation in terms of the hyper-Kahler quotient 
construction. The first observation is that the SU(A r ) gauge symmetry of the theory acts as a group 
of isometries on the hyper-Kahler quotient space. Firstly, consider the SU(A^) isometries generated by 
the Killing vectors 

_. d d 

v = l ^ u d^ ~ lWutdt 7hii~ (5 ' 30) 

on the mother space 9JI. This group action descend to act as isometries on the quotient DJlk- To see 
this, it suffices to notice that the moment maps are invariant: V ji Xr = 0. In fact the group action 
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~ ( c ) 

is also holomorphic with respect to each of the three independent complex structures, Cyl = 0, a 
property that can be easily be shown to be inherited by the action on the quotient. Consequently the 
vector fields generated by the group action are tri-holomorphic. 

Now consider a set of such isometries associated to U(l) Ar_1 C U(iV) picked out by the VEVs of the 
scalar field: 

V a = iw^ u ((P a ) u —^ - i(<fa)uWuiaJT— ■ (5-31) 

This action, as described above is tri-holomorphic on the quotient space Wtk- The lifts of the Killing 
vectors on the quotient OJlk, V a , are equal to projection of the V a to "K C T%1. The bosonic parts of the 
action (5.25) are then equal to a sum over the inner-products of the vectors V a : 



4vr 2 tr fc 



\ 9ilv {X)V»V: , (5.32) 



where g is the metric on the quotient space. As explained in §11.4.1 (and more fully in Appendix B) 
the metric on the quotient space g(X, Y) is equal to g(X, Y), the metric on the mother space evaluated 
on the lifts of X and Y. Hence, 

g(V a , V a ) = g(V a , V a ) = g(V a , V a ) - ~g{V^ V a L ) , (5.33) 

where is the projection of V a to the vertical subspace V. The V^) 1 ^^ 1 group action on 971 is 
tri-holomorphic, hence g(V a , I^X r ) = 0, so that V a already lies in the tangent space of the level set 
T^fl C TWl. Since the vectors X r , r = 1, . . . , k 2 , form a basis for the vertical subspace, 

V a ± = J2x r L; s 1 ~g(X s ,V a ) (5.34) 

rs 

and so 

g(V a , V a ) = g(V a , Va) - X r )L^g(X 9 , V a ) . (5.35) 

rs 

The matrix of inner-products L rs is given in (2.124). By explicit calculation we have 

g(V a , V a ) = 8vr 2 tr fc (w*(j>° a (j> a w & ) , g(V a , X r ) = 8vr 2 tr fc (T r iD & (f> a w & ) . (5.36) 
Substituting these expressions in (5.35) gives (5.32). 

The remaining terms in (5.25) involving the Grassmann collective coordinates can also be given a 
geometric interpretation. First of all, we recall from §IV.2.2 that, from a geometric perspective, the 
Grassmann collective coordinates are Grassmann-valued symplectic tangent vectors to the instanton 
moduli space. The action of the U(l) Ar_1 symmetry on the Grassmann collective coordinates generated 
by the vector fields V a in (5.31) is simply 

8aV.ui = K<t>°a)*»ui, &aflt = wt($)u, 5 a M'£ a = , (5.37) 

which defines 5 a A4 A . This variation can be written in a completely geometric way using the (flat) 
connection on the mother space: 

S a M U = (%Vf)M> A . (5.38) 
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We now show that the Grassmann terms in the instanton effective action (5.25) can be interpreted in 
terms of the intrinsic geometry of 971^ as 

2^ aAB tv k [^y B - w^ a w^L' 1 M A M B ) = |S aAB fi ij (X)^ iA (V k(i K j ")^ kB • (5-39) 

This follows by proving 

^{X)^ A {V^V^ B = %M lA (V^(K - V^M™ (5.40) 

and then, from (2.68), (2.69), (4.35) and (5.37), 

f: aAB n rj M lA {V^vt)M^ B = -Sitt 2 W^W* , 
t aAB ^.M iA (V^V^)M* B = -%m 2 t aAB tr k w«<t>lw«L- 1 M A M B . 
Putting this together with (5.36), and using (2.124), we have proved (5.39). 

To summarize, the instanton effective action (5.25) can be written in an elegant way involving only 
the intrinsic geometry of 971^: 

S = \{g»„{X) V»V: + | Z aAB n i3 (XW A (V^Vr)^ B + ie ABCD RiM iA ^ kC ^ D } ■ (5-42) 

We will see in §X that this effective action can be obtained from a non-trivial dimensional reduction of 
a o"-model in two and six dimensions, for Af = 2 and M = 4, respectively, with the instanton moduli 
space SDTfc as target. 



V.2.2 The size of a constrained instanton 

Having derived the leading order expression for the instanton effective action, (5.25) or (5.42), we can, 
ex post facto, justify the main assumption of the constrained instanton method that there is an effective 
cut-off on large instanton sizes. First of all, since the bosonic part of the instanton effective action is 
simply (5.32), we can identify the minima of S as fixed points of the U(1) 7V_1 action on the instanton 
moduli space DJik- Since the construction of DJlk involves a quotient by U(fc), the fixed-point condition 
is 

{<f>a) u W uia = W uj a(Xa)ji , K, Xa] = , (5.43) 

in addition to the ADHM constraints (2.65). Here, Xa are kxk Hermitian matrices acting as infinitesimal 
compensating transformation in the auxiliary group U(fc) which can depend on the VEVs (f> Q a . For generic 
values of the VEVs the solution of the equations (5.43) are as follows. At least until we impose the 
ADHM constraints, each fixed-point set is associated to the partition 

k -> fci + k 2 + ■ ■ ■ + k N , (5.44) 

up to the U(fc) auxiliary symmetry. Each i e {1, 2, . . . , k} is then associated to a given u by a map Ui 
as follows: 



{ 1, 2, . , fci , fci + 1, . . . , fci + h , • • • , 

u=l u=2 

■ ■ ■ , h H h k u -i + 1, . . . , h H h k u , ki H h k N -i + 1, . . . , k \ 

s „ ' N ^ ') 



u u=N 
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For a given partition the variables have a block-diagonal form 

(Xa)ij = -{(t>° a )uAj , W ui6l OC 5 UUt , (d'Jij OC 5 UiUj . (5.46) 

Now we impose the ADHM constraints (2.65). In the u th block the constraints are of the form of a 
set of ADHM constraints for k u instantons for which N — 1. In fact the N blocks each correspond to 
the N U(l) subgroups of the gauge group picked out by the VEVs. 32 Taking the trace of the ADHM 
constraints within the block removes the a' n dependent terms to leave equations of the form 

i=k u -i+l 

However, the solution to these equations is w uia = 0. Therefore the structure of the partitions collapses 
to leave, up to the U(fc) symmetry, a' n and Xa diagonal: 

w uia = , a' n = -diag(X^, . . . , X k n ) . (5.48) 

Taking the solution above fixes all of the auxiliary symmetry apart from permutations; hence, the 
fixed-point space is simply Sym fc R 4 , the symmetric product of k points in R 4 . So the fixed-point set 
describes a configuration where all the instantons has shrunk down to zero size. It is easy to verify that 
the gauge potential on this singular subspace is pure gauge. As the sizes of the instantons grow, the 
effective action favours instantons with sizes up to a scale ~ 1/0°, where 0° is the characteristic scale 
of the VEVs, after which they are exponentially suppressed. Therefore, constrained instantons have a 
natural cut-off on the their scale size. 

V.2.3 The lifting of zero modes 

Of paramount importance for applications is the process of lifting fermion zero modes encoded in the 
instanton effective action. First of all, consider the case with vanishing VEVs. In this case, the only 
genuinely weakly-coupled scenario is the Af = 4 theory. As we have emphasized in §IV3, only the 8 
supersymmetric and 8 superconformal zero modes are exact, the remaining 8kN — 16 are lifted by in- 
teractions and consequently we expect the instanton effective action S to depend on all the Grassmann 
collective coordinates except (4.43) and (4.46). It is easily verified from (5.26) (using the fermionic 
ADHM constraints (4.34)) that the latter sixteen variables decouple from S as expected. There is an- 
other way to phrase this result. The decoupling of the supersymmetric and superconformal Grassmann 
collective coordinates implies that the symplectic curvature of dJlk admits four null eigenvectors: 

R iiM t = . (5.49) 

The fact that the symplectic curvature has four null eigenvectors implies that the holonomy group of 
97t fc is reduced from Sp(kN) to Sp(A;iV - 2) (or for Tt k from Sp(A;iV - 1) to Sp(A;iV - 2)) [28,52]. 

32 Actually this statement is not quite correct because we are taking a gauge group SU(iV). However, the ADHM 
construction actually yields a U(7V) gauge potential, where the abelian part of the gauge group is pure gauge: in this 
sense the statement is correct. 
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When VEVs are turned on, it is easy to see that the instanton effective action now lifts the super- 
conformal zero modes via the first term in (5.25), as one expects since conformal invariance is broken. 
Of course the supersymmetric zero modes remain unlifted because the introduction of VEVs does not 
break supersymmetry. 



V.3 The supersymmetric volume form on dJlk 

In this section, we show how to construct a volume form on the space of collective coordinates of a 
supersymmetric instanton, generalizing the result (3.17) in the pure gauge theory. We shall adopt an 
approach based on the hyper-Kahler quotient construction, but we shall find a result that is identical 
with the original approach of Refs. [20,30,31] which relied on various consistency conditions, principally 
supersymmetry and clustering, in order to construct the volume form. 

We have already seen how the Grassmann collective coordinates arise in the context of the hyper- 
Kahler quotient construction. The quantities {//, /i, Ai' a } can be arranged (4.35) as an n-vector Ai 1 
(n = 2k(N+k)). The fermionic ADHM constraints are precisely the conditions for Ai 1 to be a symplectic 
tangent vector to the hyper-Kahler quotient space. Using this fact it straightforward to write down 
the integration measure for the Grassmann collective coordinates. As for the volume form itself, the 
integration measure on the quotient VJlk is induced from that on the mother space DJl. The covariant 
expression for this latter quantity is 

, (5.50) 

Pfafm 

where Q-- is the anti-symmetric symplectic tensor on the mother space (2.68). We restrict to the 
quotient space by inserting explicit Grassmann-valued (^-functions to impose the symplectic tangent 
vector condition (B.30). These (^-functions must be accompanied by a suitable Jacobian giving 

/TT 2ra ^7A/<i 1 dimG 2 
-1 TT IT 8(M%X?) . (5.51) 
Pfafm <// -4 i 

r=l a=l 

The Jacobian Jf is related to the determinant of L, (2.124), the matrix of inner products of the normal 
vectors to the quotient space: 33 

Jf = |det fc2 L\ . (5.52) 

For the particular quotient that yields the ADHM construction DJI is flat, and Pfafff2 is simply a 
numerical factor and we can write the measure for each species of fermion as 

. k 2 2 

/ d 2k{N+k) M |det fc2 L| _1 Yl Yl 8(ti k T r (Ma 6l + a & M)) ■ (5.53) 

J r=l a=l 

where 34 

» » k' 2 k N 

/ d 2k ^M ^ / Hd 2 (M'Y IHldjkud^ . (5.54) 



r=l i=l u=l 



33 The following can be deduced from Eq. (5.79). 

34 0ur convention for integrating a two-component spinor ip a is / d 2 ip = J dtpi d\p2- 
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Putting this together with volume form for 9R k , (3.17), the collective coordinate volume form for an 
arbitrary supersymmetric theory is 



/ W W = / d ik ^a FT d 2k ^M A |det fc2 L 

Jm k VolU(fc)7 

fe 2 , 3 AT 2 

x II II K¥^T r {r^^a^)) J] JJ 5(tr fc T r (^V + a*A< A )) 



l-A/" 

(5.55) 



r=l ^ c=l A=l d=l 

The normalization factor can be determined by a careful analysis of the inner products of the zero 
modes: 

An independent check of the normalization constants Cj^ can be achieved, as in §111.2.1, by invoking 
the clustering property of the instanton integration measure. In order to apply this argument we have 
to consider how the Grassmann integrals behave in the complete clustering limit. In this limit, the 
off-diagonal components of the fermionic ADHM constraints are dominated by the term 

(X* - X j f a M' a + ■ ■ ■ . (5.57) 

We can therefore use the off-diagonal constraints to saturate the (M.' a )ij, i ^ j, integrals. This yields 
a factor of 

l[(X l - V f , (5.58) 

for each species of fermion. However, each species of fermion is accompanied by a factor of |detfc2_L| _1 
which clusters as (3.24). Hence, what remains in the complete clustering limit are the diagonal fermionic 
ADHM constraints which to leading order are the fermionic ADHM of the individual instantons. Hence, 
taking into account the clustering of the purely bosonic parts of the volume form, as in §111.2.1, we find 
that the supersymmetric measures clusters consistently. 



V.3.1 Supersymmetry 

In this section, we will verify that the volume form Eq. (5.55) is invariant under the supersymmetry 
transformations acting on the collective coordinates that we established in §IV.5. 

To begin with, consider the supersymmetry variations of the bosonic and fermionic ADHM con- 
straints, (2.65) and (4.34), since these appear as the argument of the (^-functions in (5.55). First, the 
c-number ADHM constraints (2.65): 

5(r^a%) = - i&M A a $ + t^ A a & M A ) = -ir^C A {M A a $ + a $ M A ) . (5.59) 

In other words, the bosonic ADHM constraints transform into a linear combination of the fermionic 
ADHM constraints (4.34). Now the fermionic ADHM constraints themselves: 

5{M A a h + a & M A ) = 4i£A(b a a & - a A b a ) + i£ &B (M A M B - M B M A ) + 2i^ B X AB (c£ a d - a«Cf) 
= iU(M A M B - M B M A - 2£fV>> Q + TL AB L^> a ) . 

(5.60) 
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To prove this, we used (2.54b) to show that first bracket on the right-hand side of (5.60) vanishes. 
Going from the first line to the second line involved using the bosonic ADHM constraints (2.65) along 
with the definition of L in Eq. (2.125). The final expression then vanishes by virtue of the definition of 
(p a in Eq. (4.65) along with a S-matrix identity (A. 20). 

The fact that the variation of the bosonic (fermionic) ADHM constraints involves the fermionic 
(bosonic) ADHM constraints looks promising because it means that the variation of the product of 
5-functions in (5.55) vanishes to linear order. For the M = 1 measure, this is sufficient to prove 
supersymmetric invariance. The reason is that the transformation of a„ involves M. but that of .M is a 
constant. Hence, the super- Jacobian for the transformation of {a^, M.} vanishes at linear order. Since 
there are no other contributions to consider, the Af = 1 measure is a supersymmetric invariant. 

However, the story is more involved with extended supersymmetry. The reason is that, contrary to 
the M = 1 case, the variation of the Grassmann collective coordinates (4.70) involves the Grassmann 
collective coordinates through the dependence of and on tp a , (4.71), which in turn depends on 
Ai A via (4.65). This means that at linear order there is non-trivial super- Jacobian for the transforma- 
tion of {da,Ai A }- In order to prove supersymmetric invariance, this super- Jacobian must cancel the 
transformation of the remaining factor [def^L] 1 ^ that we have not, hitherto, considered. 

Rather than evaluate the variations of these two quantities, and then show that they cancel, we will 
proceed in more indirect fashion. The idea is to remove the dependence of 5M A on the Grassmann 
collective coordinates by introducing some auxiliary variables. The obvious candidate is tp a itself. In 
order to implement this idea, we need to find out how tp a transforms. We can answer this by considering 
the variation of the definition (4.65). Writing this as 

L ■ (f a = \Y, aAB M A M B + w^° a w A , (5.61) 

the variation of the left-hand side is the sum of L ■ 5ip a and 

(SL) ■ Va = \ ^ ({p. A w & - w af i A + M' aA a' a6l + a' a& M' A , <p a } - 2M' aA Va a' a6l - 2a' a ^ a M' aA ) 

(5.62) 

= il\ (p A w&<p a + [M' aA a' a& , <p a ] - Va w^ A + [if a , , 

using the definition of L in (2.125). The last equality follows by using the fermionic ADHM constraints 
(4.34). For the variation of the right-hand side of (5.61), one finds 

\ ^ aBC ?P h A i%C^M B - M B C M ) + ^(// A frd - w^y A ) . (5.63) 

Taking the difference of (5.62) and (5.63), one deduces the variation of ip a : 

L-5 Va = i{^ aBC ^ A - 5 ab 5 A }e A F b l , (5.64) 

where we have defined 

= [C b «M B - M B C b& ) . (5.65) 

This looks disappointing because the variation of (p a seems to depend on itself via C b a and Cbd- However, 
notice that the variation of the Grassmann collective coordinates (4.70) actually depends not on ip a 
directly but, more precisely, on the combination T, AB (p a . From (5.64), we have 

l ■ = m - ^a c Ft + - ^rm . (5.66) 
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Now for Af = 2 supersymmetry, there is only a single independent quantity Y* AB ip a which we can take 
to be if = ^afa = ifi + V?2- in this case the right-hand side of (5.66) vanishes identically; hence 

Sip = . (5.67) 

We now consider the extended set of variables {a&, A4 A , </?}, where the latter is subject to its own 
"ADHM constraint" following from its definition (4.65): 

L-^ = -\(M 1 M 2 -M 2 M 1 )+w*<fwc l , (5.68) 

where 0° = S^ 2 0°. The Af = 2 measure (5.55) can then be written in a suggestive way by replacing 
the factor of |detfc2 with an integral over the auxiliary variable ip with an explicit ^-function which 
imposes the new ADHM constraint (5.68): 

jdet^Lp 1 = f d k \ Y[5(tr k T r (L ■ if + \M l M 2 - \M 2 M l - w & <jPw & )) • (5.69) 

J r =l 

It is then easy to see that, when re-written using the above identity, the new form of the measure is 
supersymmetric. This follows because the variations of {da,A4 A ,ip} are either off-diagonal or vanish; 
hence, the super- Jacobian vanishes to linear order. To complete the proof, the variations of the ADHM 
constraints, (2.65), (4.34) and (5.68) are also either off-diagonal or vanish, 35 and so the variation of the 
product of (^-functions also vanishes to linear order. 

For Af — 4 supersymmetry, we have to go back to (5.66) and follow the same logic. To make 
things simpler, and ultimately without loss-of-generality, we can focus on particular a supersymmetry 
variation, say In that case, the variation of the Grassmann collective coordinates (4.70) involves the 
three quantities y? A1 = SfV, A = 2,3,4. In that case, from (5.66), we have 

L ■ 5 V A1 = ig( - T}^t a + Sf ^ - Sf F\) = . (5.70) 

The new auxiliary variables (p A1 are subject to their own "ADHM constraints" : 

L-<p A1 = l (M 1 M A - M A M 1 ) + w d <#>d . (5.71) 

So for this particular supersymmetry variation, we consider the multiplet of variables {a„, A4 A , <p A1 }- 
The Af = 4 collective coordinate integral (5.55) can then be re-cast by introducing integrals over the 
auxiliary variables ip A1 along with explicit 5-functions to impose (5.71): 

|det fc2 L|~ 3 = ! \{{d k2 v A1 \{5{ti k T r {L- v A1 -\M l M A + \M A M l -T 1 A1 w^ (5.72) 

J A=2 ^ r=l > 

It is then straightforward to prove, following the same logic as for Af = 2, supersymmetric invariance. 
Firstly, the variations of {a„, M A , (p A1 } are either off-diagonal or vanish; hence, the super- Jacobian 
vanishes to linear order. In addition, the variations of the ADHM constraints, (2.65), (4.34) and (5.71) 
are also off-diagonal or zero. Consequently, the variation of the product of 5-functions also vanishes to 
linear order. The new twist in the Af — 4 case is, for each supersymmetry variation £4, we must use a 
different set of three auxiliary variables Y* AB (p a , B 7^ A to prove supersymmetric invariance. Invariance 
of the measure (5.55) under a general supersymmetry variation then follows by linearity. 
35 Note that the variation of (5.68) vanishes since we imposed it to find the variation of ip a above. 
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V.4 From J\f = 4 to J\f = via decoupling 



An interesting consistency check on our collective coordinate integrals, Eq. (5.14) along with (5.55), 
which relates the expressions for different numbers of supersymmetries follows from renormalization 
group decoupling. The idea is to take one of the supersymmetric theories and add mass terms for some 
of the fields in such a way that for large masses the massive fields can be "integrated out" and the theory 
flows in the infra-red to a theory with fewer supersymmetries. This procedure, when implemented at 
the level of the semi-classical approximation, provides a way to relate the instanton integration measure 
for different numbers of supersymmetries. 

We begin, with the Af = 2 theory, and give a mass to one of the two M = 1 chiral multiplets. 
Adding such mass terms is discussed in §VI.4. For a single flavour of fermion, the leading order effect, 
is to introduce (6.89) 

~ f 777 71 ^ — 171 ~ ~ 

S mass = m / d 4 xtr N \ 2 = tr fc + 1)M = -—Q~ 1 ~ ] M i M i , (5.73) 

J 9 ' Ag s 

into the instanton effective action, where the inner product of two Grassmann symplectic tangent vectors 
was defined in (4.38). 

The integral over the Grassmann collective coordinates M. is written down in (5.53). It is convenient 
to re-write the argument of the fermionic ADHM constraints using (4.36) and then including the mass 
term, we have 

. k 2 2 

U = Idet^Lp 1 / d 2k ^M J] J] 5(^M%-Xl") exp (^^M'M^j . (5.74) 

J r=l d=l ^ 



In order to evaluate the integral it is useful to decompose 



(5.75) 



where (M. ± ) 1 ClrrXl a — for all r — 1, . . . , k and a = 1,2. The quantity M. 1 - is the projection of 
M. which does not appear in the arguments of the (^-functions and its integrals must be saturated 
by bringing down powers of the mass term. Hence with the decomposition (5.75), the integral (5.74) 
factorizes as 



J = |det fc 2L| 



det 2k 2^X^n ri X^ 



j/3 



1/2 



j d^M 1 - exp ^l .M - M ") . (5.76) 



The determinant in the denominator is the Jacobian for transforming to the variables a r a- The Grass- 
mann integrals can now be done to give 



det^Iy 



-i 



det^X^Xf 



1/2 



2mTT 2 \kN 



9 



(5.77) 
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The second determinant factor can be related to a more familiar quantity by noting the following. From 
Appendix B, we established that on the level set of the hyper-Kahler quotient, the U(/c)-vectors X r 

~ (c) 

satisfy g(X r , I X s ) = 0. In terms of components 

e^X^Qr.X^T^ = implies X^HlXi ' oc . (5.78) 
Hence, from the definition of L is (2.124) and the explicit relation g(X r ,X s ) = e^X^fi-jX]^, we have 

det 2k ^xl«n..Xy\ = |det fe2 ^|e.^QrjXf | 2 = |det fc2 L| 2 . (5.79) 
Consequently, the two determinants in (5.77) cancel to leave a purely numerical factor 

J=(*=L) . (5.80) 

Using this expression for the integral (5.74), we find that after decoupling the Af = 2 collective 
coordinate integral gives the Af = 1 collective coordinate integral with the following relation between 
the Pauli-Villars mass scales: 

/^L = « , (5-81) 

Alternatively, we can phrase the result in terms of the renormalization group A-parameters defined in 
(5.17): 

K M=1 =(g 2 m) N K N=2 . (5.82) 
Therefore, the decoupling limit is g 2 m — > oo and A^ =2 — > in such a way that A_^ =1 is fixed. 

Now we turn to the relation between the Af = 4 and Af = 2 theory. We start with the M = 4 
theory and add an Af = 2 preserving mass term for a pair of Af = 1 chiral multiplets. This results in a 
contribution (6.89) to the instanton effective action: 



m 



J d 4 xtr N (A 3 A 3 + A 4 A 4 ) = -— tr fc [M 3 ^ + 1)M 3 + M 4 ^ + 1)M 4 } (5.83) 



We can now integrate out the two flavours of Grassmann collective coordinates A4 3 and A4 4 as above 
to verify that the collective coordinate integrals are related if 



& = ™ 2N ■ (5-84) 

This means that 

A x=2 = m 2N e-^ 2 ^ e (5.85) 
and so the decoupling limit is m — > oo and g 2 — > 0, in such a way that A AA=2 is fixed. 

We remark that (5.82) and (5.85) are consistent with the standard prescriptions in the literature for 
the renormalization group matching of a low-energy and a high-energy theory [53,54]. The absence of 
numerical factors on the right-hand side of these relations reflects the absence of threshold corrections 
in the Pauli-Villars scheme. 
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Finally, we can take the J\f = 1 theory and decouple the gluino to flow at low energies to the non- 
supersymmetric theory. In this limit, the M = 1 collective coordinate integral gives the extrapolation 
of (5.55) to Af = with the relation 

4N N 3N /r or\ 

/V= = m /V=i • ( 5 - 86 ) 

However, in the non-supersymmetric theory there is also a non-trivial fluctuation determinant described 
in §111.3 which is not, of course, reproduced. 
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VI Generalizations and Miscellany 



In this section we describe some important generalizations of the instanton calculus and other results. 
In §VI.l, we describe how the ADHM constraints can be solved when N > 2k. This will be useful in 
applications where a large-iV limit is involved, as in §VII and §IX. Then, in §VI.2, we explain how 
to generalize the instanton calculus to theories with gauge groups Sp(iV) and SO(iV). This is useful 
in the application to M = 2 theories described in §VIII involving gauge group SU(2) because it turns 
out that the instanton calculus of Sp(l) (~ SU(2)) is actually more economical and convenient than 
the SU(2)-as-an-example-of-SU(A) formalism developed previously. Moving on, in §VI.3, we describe 
how to include matter fields in the calculus. This is important for the applications in §VII and §X.l. 
The question of how to modify the instanton calculus when fields have masses is considered in §VI.4. 
Finally in §VI.5 we consider the instanton partition function in more detail. In particular we show how 
it can be "linearized" by introducing various auxiliary variables including Lagrange multipliers for the 
bosonic and fermionic ADHM constraints. We also show how the integrals over the overall position 
coordinate and its superpartners can be separated out, leading to the notion of the centred instanton 
partition function. 



VI. 1 Solving the ADHM constraints for N > 2k 

It turns out that when N > 2k the ADHM constraints can be solved in a certain generic region of the 
moduli space by a change of variables [21]. The idea is to introduce the gauge invariant U(A)-invariant 
bi-linears 

0'"M;; (6-1) 
which can be incorporated into four k x k matrices 

W° = tr 2 W, W c = tr 2 (T c W) . (6.2) 

When N > 2k, these variables are independent and the change of variables from the 4kN real variables 
{wa,w a }, to the bi-linear variables {W°, W c } and the gauge orientation XL (defined in §11.4.2) is invert- 
ible, at least when the gauge orbit is generic as in Eq. (2.87). In changing variables, one must also, of 
course, specify the integration domain for the new variables. In particular, since the W variables are 
the inner products of vectors they will be constrained by various triangle inequalities. 36 

The reason why this change of variables is useful, is because the ADHM constraints (2.65) are linear 
in W c ; indeed they can be re-cast as 

W c = -a' m a' n tr 2 (r c a m a n ) . (6.3) 

The final description of the subspace of the moduli space on a generic orbit of the gauge group for 
N > 2k is then in terms of the five k x k Hermitian matrices, W° and a' n , in addition to XL. However, 

36 Fortunately, these technicalities will not be relevant in our applications at large- N. In this case, steepest-descent 
methods apply and the saddle-point values of the W's obtained in §IX and §VII satisfy all such triangle inequalities by 
inspection. In addition, we will show that on the saddle-points the k instantons inhabit k commuting SU(2) subgroups 
of the the gauge group. Hence, manifestly, they lie on a generic orbit of the gauge group (2.87). 
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as long as one is in a phase where the gauge symmetry is not broken then physical quantities are gauge 
invariant and one can readily integrate over the gauge orientation II. 



In order to exploit this linearizing change-of-variables, we must determine how the instanton inte- 
gration measure transforms in going from {wa,w a } to the bi-linears {W°, W c } in Eq. (6.2) and the 
gauge orientation U. In what follows it will be useful to think of w„ as an iV x K- dimensional matrix 
w, where K = 2k, and to introduce the composite index a = ia — 1, . . . , K, so that the elements of w 
are w ua . A suitable SU(iV) gauge transformation IX puts the matrix w into upper-triangular form, as in 
(2.83). The are complex except for the diagonal elements £ aa which we can choose to be real. While 
a priori the £ ab are only defined for 1 < a < b < K, it is convenient to extend them to \ <b < a < K 
as well, by defining £ ab = In terms of these extended variables, the gauge-invariant bi-linear W 

defined in (6.1) is the matrix 



W = w 



?21 <S22 



£22 • ' • £,K2 



Kl C.K2 





W 



\o ■■■ i KK ) 



(6.4) 



Note that there are as many real degrees-of-freedom in the {IVab} as in the {£ a ft}- From Eq. (6.4) it 
follows that 

K 2 

det K W= (J[U) ■ (6-5) 



a=l 



When calculating the Jacobian, it is useful to pass through an intermediate change-of-variables 
involving £ rather than the W. The Jacobian for this is 



j d K2 W = 2 K J d K \f{i 

a=l 



2K-2a+l 
aa 



(6.6) 



This can be proved by induction. For K = 1 one has simply Wn = £f x . From Eq. (6.4) we can also 
easily relate the Jacobian for K to that for K — 1; one finds 



d({W ab }) 



d({U}) 



dW KK ( |j 8W aK dW Ka \ d({W ab }) 



K d^ KK V OUk d^Ka > <9({£„ b }) 



K-l 



a=l 



d({U}) 



(6.7) 



K-l 



The result (6.6) follows by induction. 



Next, we calculate the Jacobian for the change of variables from w to {£,11}. To this end, define 
u a = U ua to be the a th column of U, a = 1, . . . , K. Since U C SU(iV) 



(6.i 



The K A^-vectors u a then provide the well-known parameterization of the coset (2.84) as a product 
of spheres [55]. To see this, u 1 is a unit vector in an N- dimensional complex space and consequently 
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parameterizes S 21 *' 1 . The second vector u 2 is also a unit vector, but one which is orthogonal to u 1 , and 
consequently parameterizes S 2N ~ 3 . Continuing this chain of argument, we see that the vectors {u a } 
parameterize the product of spheres 

SU(AQ Q 2N-1 v c 2iV-3 v . . . v 02AT-2X+1 



SU(iV- AT) 



x S 2JV ~ d x ••• x S 2N - 2K+L . (6.9) 



From (2.83) we can read off the expansion of the elements of w in terms of the elements of the 
vectors u a : 

a 

W ua = J2^aU b u . (6.10) 
6=1 

The Jacobian can be determined through the following iterative process. First we start with w u \ = £ii M l) 
whose measure can be written in polar coordinates as 

f n dw ul dw* ul = 2 N j e^' 1 dfn rf^-^i • (6.H) 

J u=l J 

Here d 2iV_1 f2i is the usual measure for the solid angles on S 21 *' 1 parameterized by u 1 . Continuing the 
process on the next vector w u2 = £i2 M i + £22^, we have 

f J] dw u2 dw* u2 = 2 N ~ 1 f di 12 dC l2 e 2 t 3 di 22 d 2N ~ 3 Q 2 . (6. 12) 

J u=l J 

In general 

/N . a-1 

II dw ua dw* ua = 2 N -°- 1 / { J] dt ba dC ba }a- 2a+1 dUd 2N - 2a+1 n a . (6.13) 
u=l J 6=1 

where Cl a is parameterized by u a . Hence 

J d 2KN w = &K-K { K- m j {f[^- 2a+1 dUd 2N - 2a+1 n a } { i[dUd€ ab } ■ (6.14) 

Using (6.5) and (6.6), we obtain 

J d 2KN w = 2 NK - K ^' 2 J \det K W\ N ~ K d K2 W { f[d 2N - 2a+1 Q a } . (6.15) 



a=l a<b 



K 



a=l 



Re-introducing k = K/2, we have 

j d 2kN wd 2kN w = A k j \det 2k W\ N ~ 2k d k2 W° J] d k2 W c d 4k ^U. (6.16) 

J J c=l,2,3 

The integral over the 2k x 2k matrix W has been written as four separate integrals over the k x k matrices 
W° and W c , defined in (6.2), with respect to the basis {T r }, defined in §111. 2. 37 For convenience we 
have defined a unit normalized measure on the coset space: 

f d ±HN-k) U dcf 1 f TT rf 2iV-2a + l^ (61?) 



37 This accounts for an additional factor of 2 2fe . 
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and this fixes the normalization constant to be 



2k n9bM-A.b 2 -i-b ObN-Ob 2 - 



r > 2kN~4k 2 +k„2kN~2k 2 +k 

A = 2 2kN-Ak 2 -k TT Vol 5 ,2 ( Ar - a )+ 1 = t _L . ( 6 .i 8 ) 

In the expression (3.17) for volume form on DJlk, the ^-functions imposing the ADHM constraints 
simply soak up the integrals over W c (giving rise to the numerical factor of 2 3fe2 from the |'s in the 
arguments of the (^-functions) to leave 

f u= 2 ^4^ I d 4k2 a'd k2 W d Ak ^U |det fc2 L\ \det 2k W\ N - 2k . (6.19) 
J m VolU(fc) J 



The simplifications for N > 2k also extend to the Grassmann sector. The trick is to find a change 
of variables in the Grassmann sector which mirrors that which we have just described for the c-number 
collective coordinates. To this end, let us identify the super-partners of the collective coordinates 
associated to global gauge transformations on the instanton solution. Infinitesimally, the latter are the 
subset of 5w which preserve the gauge invariant variables W, i.e., which satisfy 

KJw uj$ + 5w? u w uj$ = . (6.20) 

Under a supersymmetry transformation (4.68) one has 

Sw ui a = i^aA^ti , <K* = ■ (6-21) 

Inserting Eq. (6.21) into Eq. (6.20) produces the gauge-invariant conditions 

C ;,«•>;' + = o (6.22) 

or equivalently, 

</4 = and P-iw vj& = 0- (6.23) 
To satisfy these constraints, it is convenient to decompose fi A as follows: 

= « nMC A h + vt fit = ((a )y< + vf u , (6.24) 

where v A lies in the orthogonal subspace to w: 

= 0, Pf u w uj& = . (6.25) 

The superpartners of the bosonic coset coordinates IL are then precisely the variables {u A 1 u A }. 38 Notice 
in the case of a single instanton, the variables ( A = ( A are precisely the Grassmann collective coordinates 
associated to superconformal transformations. 



We now turn to the Grassmann part of the instanton measure (5.55). As explained above, the 
superpartners of the global gauge collective coordinates IX are the Grassmann variables {z/ A , u A } defined 

38 It is worth mentioning that although the coset coordinates correspond to bosonic zero modes which are generated by 
Lagrangian symmetries, this is not true of their Grassmann partners. 
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in Eqs. (6.24)-(6.25). Since these coordinates are orthogonal to the w and w vectors, respectively, it is 
easy to see from (4.31a) that they do not appear in the fermionic ADHM constraints. The Jacobian for 
the change of variables from the original Grassmann coordinates {/j, A ,fi A } to {( A , ( A , u A , z/ A }, is, for 
each value of A, 39 

V ; - det 2fc W^ . (6.26) 



d({C A ,( A ^ A ,u A }) 

As in the bosonic sector, the change of variables allows us to integrate out the Grassmann-valued 
5-functions in the measure (5.55) trivially. The arguments of the (^-functions are the fermionic ADHM 
constraints which, in terms of the new variables, are (4.31a) are 

Cfwi + W a$ C pA + [M' aA , a y = . (6.27) 

These equations can be used to eliminate the 2k 2 variables ( A , for each A. The relevant integral is 
simply 

> 2 C A 1111^^ + ^+ [M' aA ,a> adl ])) = \det 2k W\ k . (6.28) 

r=l a=l 

Notice that the factor on the right-hand side conveniently cancels the Jacobian of (6.26). 



The expression for the supersymmetric volume form on the instanton moduli space in the case 
N > 2k is 

r o3fc 2 /-<(A0 a r N , 

/ u w = t Ak / d 4k2 a'd k2 W°d 4k ^U TT U k{N - 2k) v A d k{N - 2k) 9 A d 2k2 ( A d 2k2 M ,A \ 

J mk VolU(fc) J Al\ / (6 29) 

x |det fc 2 L\ \det 2 kW\ 

This expression for the measure is the starting point for two applications of the multi-instanton calculus 
at large iV that we describe in §VII and §IX. 



VI.2 The ADHM construction for Sp(N) and SO(N) 

The ADHM formalism [7] for constructing instanton solutions was adapted for dealing with any of 
the classical gauge groups in the early instanton literature. The method adopted was to consider the 
construction for one of the series of classical groups, e.g. symplectic groups in Ref. [56] and orthogonal 
groups in Ref. [57], and then embed the other two series in this series. Our approach will be no different, 
although we will start from the unitary series. The advantage of this is that all the previous formulae 
that we have established in the SU(iV) case can easily be adapted to describe the other gauge groups. 

In order to construct instanton solutions for gauge theories with Sp(iV) and SO(iV) gauge groups 
we use the embeddings 40 

Sp(iV) C SU(2iV) , SO(A0 C SU(iV) , (6.30) 

39 We define the integrals over the k x k matrices ( aA and C, aA with respect to the Hermitian basis T r introduced in 
§11.4.1. 

40 For the orthogonal groups we restrict N > 4. 
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to extract the ADHM formalism for these groups in terms of the SU(iV) ADHM construction. The 
surprising feature of the resulting formalism is that the auxiliary group, U(k) in the SU(iV) case and 
denoted generally as H(k) at instanton number k, is not in the same series as the gauge group G. Table 
6.1 shows the auxiliary groups and defines the quantities N' and k' allowing us to present a unified 
treatment of Sp(iV) and SO(iV). 



G 


SU(iV) 


Sp(iV) 


SO(iV) 


H{k) 


U(k) 


0(k) 


Sp(fc) 


N' 


N 


2N 


N 


k' 


k 


k 


2k 



Table 6.1: Gauge and associated auxiliary groups 



To describe the other classical groups we start with the theory with gauge group SU(iV') at instanton 
instanton number k! . Instanton solutions in the Sp(iV) and SO(iV) theories follow by simply imposing 
certain reality conditions on the ADHM construction of the SU(iV') theory which ensures that the gauge 
potential lies in the appropriate sp(N) and so(N) subalgebra of su(N'). In order to deal with both 
the Sp(iV) and SO(iV) case at the same time it is useful to define the notion of a generalized transpose 
operation denoted t which acts either on gauge or instanton indices. Specifically, on Sp(n) group indices 
t acts as a symplectic transpose, i.e. on a column vector v, v l = v T J T , where J is 2n x 2n the symplectic 
matrix 




(6.31) 



while on O(n) group indices t is a conventional transpose t = T. The adjoint representations of 
both groups are Hermitian t-anti-symmetric matrices and have dimensions n(2n + 1) and n(n — l)/2, 
respectively. Hermitian t-symmetric matrices correspond to the anti- symmetric representation of Sp(n), 
with dimension n(2n — 1) and the symmetric representation of SO(ra), with dimensions n{n + l)/2. 

The additional reality conditions on the ADHM variables are 

™ a = ^(w $ Y , (a' a J = a' aa . (6.32) 

These reality conditions are only preserved by the subgroup H(k) C U(A; / ) of the auxiliary symmetry 
group of the SU(iV') theory. The matrices a' n are Hermitian and, by (6.32), t-symmetric, i.e. real 
symmetric in the case of auxiliary group O(k), and symplectic anti-symmetric in the case of auxiliary 
group Sp(/c). It is easy to verify that the ADHM constraints (2.65) themselves are anti-Hermitian 
t-anti-symmetric, in other words H(k) adjoint-valued. It is straightforward to show that these reality 
conditions are precisely what is required to render the gauge field (2.49) t-anti-symmetric, in other 
words to restrict it to an sp(N) and so(N) subalgebra of su(N'), respectively. 

The Grassmann collective coordinates are subject to a similar set of reality conditions: 

= (M'J = M' a . (6.33) 
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So M' a is t-symmetric. The fermionic ADHM constraints (4.34) are, like their bosonic counterparts 
t-anti-symmetric. 41 

We can now count the number of c- number and Grassmann collective coordinates. For both Sp(iV) 
and SO(iV) at instanton number k there are 4kN real independent w variables, taking into account 
the reality conditions. The number of a' n variables is 4 x k(k + l)/2 and 4 x k(2k — 1), for Sp(iV) 
and SO(iV), respectively. The physical moduli space is then the space of these variables modulo the 
three if (A;)-valued ADHM constraints (2.65) and auxiliary H(k) symmetry. Hence the dimension of the 
physical moduli space is Ak(N + 1) and Ak(N — 2), for Sp(iV) and SO(iV), respectively. This agrees 
with the counting via the Index Theorem. The counting of the Grassmann sector of the physical moduli 
space goes as follows. There are 2k N real degrees-of-freedom in /x and 2 x k(k + l)/2 and 2 x k(2k — 1), 
in Ai' a , for Sp(iV) and SO(iV), respectively. The ADHM constraints then impose 2 x k(k — l)/2 and 
2 x k(2k + 1) conditions, for Sp(iV) and SO(iV), respectively. Hence there are 2k(N + 1) and 2k(N — 2) 
real physical Grassmann collective coordinates for Sp(iV) and SO(iV), respectively. Again this agrees 
with the counting via the Index Theorem. 

As an example, consider the Sp(l) theory. Since Sp(l) ~ SU(2), this should have the same content 
as the SU(2) ADHM construction that we have described in §11.4; however, we will find that the Sp(l) 
description is more economical in that there are fewer ADHM variables subject to fewer constraints for a 
given instanton number. This makes the Sp(l) formalism particularly attractive in certain applications 
(for instance in §VIII). In this case, the reality conditions (6.32) are explicitly 

Kia = ea$J uvW ui p , (a'Jij = (a'Jji . (6.34) 

Given that a' n are Hermitian, the second condition implies that a' n are real symmetric k x k matrices. 
We now write the gauge group indices as a — 1, 2 rather than u = 1,2. In this case the first condition 
becomes 

<«* = e^w ip$ . (6.35) 

This means that Wi are quaternions of the form (2.9), so Wi a a = Wi n a na a for real Wi n . The fact that the 
gauge indices are also labelled by a means that the ADHM variable can be written as a quaternion: 

a«a = ( W n r) . (6.36) 

At instanton number k, there are 2k(k + 3) variables a a „ subject to 3k(k— l)/2 ADHM constraints and 
k(k — l)/2 symmetries to give the dimension of QJTfc as 8k. This compares with the SU(2) description, 
where there are 4k(k + 2) variables subject to 3k 2 ADHM constraints and k 2 symmetries. Clearly 
the Sp(l) formalism is more economical. For example, at the one-instanton level there are no ADHM 
constraints in the Sp(l) formalism compared with three in the SU(2) formalism. 

In general all the formulae that we have established in the instanton calculus carry through to the 
Sp(A^) and SO(A^) cases without change. The only difference is the ADHM variables are subject to the 
reality conditions (6.32) and (6.33). In particular, the collective coordinate integration measure and 
the instanton effective action have the same form established in §V. Only the normalization constant 
for the integral over the instanton moduli space C (A ° is changed. 
41 In proving this it is useful to notice that (u4)* = —Wu and (^*)* = — \i. 
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VI. 3 Matter fields and the ADHM construction 



In this section we consider various aspects of the instanton calculus in supersymmetric theories with 
matter fields. We will only consider matter in the fundamental representation of the gauge group 
(although the ADHM formalism can be extended to any higher representation by the tensor product 
formalism developed in [18]). There are two distinct applications: to M = 1 theories on the Higgs 
branch and M = 2 theories on the Coulomb branch. 



VI. 3.1 Af — 1 theories on the Higgs branch 



Let us start by considering a theory with the M = 1 vector multiplet coupled to a a single fundamental 
chiral multiplet Q = {q,x)> where q is the scalar (Higgs) field and x is the Weyl fermion partner 
(Higgsino). The Euclidean space action for the matter fields (with no super-potential) is 

^matter = J d 4 x \v n q^V n q - V n xp n x - V^igxM + V2igq* \\ + |fi' 2 (<? t <?) 2 } • (6.37) 

On the Higgs branch, the scalar field will have an arbitrary VEV q° and the instantons become con- 
strained. As previously we can capture the leading-order behaviour in the semi-classical limit by an 
appropriate approximate instanton solution. As on the Coulomb branch of the M = 2 theories discussed 
in §IV.4, the anti-chiral fermions are zero for the approximate instanton: A = x = 0. To leading order, 
therefore, the fundamental-valued chiral fermion x satisfies the source free Weyl equation in the ADHM 
instanton background 

fx = • (6.38) 
The fundamental fermion zero modes were originally constructed in [18]. In our language, they read: 

Xa = g- 1/2 Ub a flC , (6.39) 

where /Q are k new Grassmann collective coordinates. In the case a fundamental fermion there are 
no analogues of the fermionic ADHM constraints for the new Grassmann coordinates /C. So there are 
k independent zero modes which agrees with counting via the Index Theorem. The proof that (6.39) 
satisfies (6.38) is a straightforward exercise in ADHM algebra, using (2.51), (C.19) and (C.2a): 

P" a Xa =g- 1 ' 2 <%*{a n UA $ ftfb a f + Ub a d n f)K 

(6.40) 

=9- 1/2 5* a U( - bafAWv^f - b^fm a f))C = . 

On the other hand, to leading order the Higgs field q satisfies an inhomogeneous covariant Laplace 
equation: 

V 2 q = V2ig\ X (6.41) 
together with the VEV boundary conditions 

q x =°° q° , (6.42) 



88 



where q° = . . . , q%) denotes the fundamental VEV. The right-hand side of Eq. (6.41) is the product 
of the classical configurations (4.29) and (6.39), respectively. The general solution to Eqs. (6.41)-(6.42) 
is 

Q = U - ^UMfJC . (6.43) 

The proof is another straightforward exercise in ADHM algebra (very similar to that in Appendix C 
Eq. (C.25)). Firstly similar to (C.3b), 

V 2 (UJ) = ~AUb a fb a J + Ud 2 J - 2Ub a fa na ^d n J . (6.44) 

Now take J = — jMfIC and compare with 

g\ X = U(Mfb a - b a fM)Vb a flC . (6.45) 

Using the differentiation formulae (C.2b) and (C.2a) along with A^M. = —MAa and b a M = Mb a , 
one finds that the 2nd term in (6.44) matches the 1st term in (6.45), while the sum of the 1st and 3rd 
terms in (6.44) matches the 2nd term in (6.45). 

As usual in the supersymmetric instanton calculus, q^ ceases to be the conjugate of q in the instanton 
background, since it satisfies V 2 q i = to leading order. Consequently 

^ = ((<?%, l2k] )U. (6.46) 

Notice that q^ fails to be the conjugate of q due to the presence of terms bi-linear in the Grassmann 
collective coordinates in the latter. This is a — by now familiar — symptom of working in Euclidean 
space. 



To capture the leading order behaviour we have to take the approximate instanton solution, including 
the matter field solutions discussed above, and substitute them into the Euclidean action. The gauge 
field term in the component Lagrangian yields —lixir as always. Following the method of Refs. [48,58], 
the two other relevant terms of the action, namely the Higgs kinetic term and the Yukawa interaction 
involving the chiral fermions are turned into a surface term with an integration by parts in the former 
together with the Euler-Lagrange equation (6.41) for the fundamental scalar: 

S = Jd^x [vJV n q + iV2q*\ X } = j d A x {d n (q*V n q) +q\-V 2 q+ V2ig\ X ) } . (6.47) 



Using Gauss's Theorem, the contribution to the action may then be extracted from the asymptotic 
fall-off at infinity 

^.l-^^V + i^) (6.48) 
and hence to lowest order the instanton effective action is 

S = n 2 [( q yw„w & q° + ^(?°)VC} • (6-49) 

This A;-instanton formula, although written in ADHM collective coordinates, is nevertheless easily com- 
pared with the one-instanton expression for the action with SU(2) gauge symmetry found in Ref. [38]. 
The first term in parentheses is equivalent to ^ I <?° 1 2 P 2 , summed over the k different instantons, where 
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Pi is the scale size of the i th instanton. Also the second term in parentheses is the fermion bi-linear 
necessary to promote this p 2 to the supersymmetric invariant scale size constructed in [38]. Indepen- 
dent of one's choice of collective coordinates, the presence of the VEV in the instanton effective action 
(6.49) gives a natural cut-off to the integrations over instanton scale sizes [2], leading to an infrared-safe 
application of instanton calculus. 

The expressions given above may be immediately extended to phenomenologically more interesting 
models with N F fundamental flavors of Dirac fermions (known as a hypermultiplet). In this case the 
gauge multiplet is minimally coupled to 2N F chiral superfields Qf and Qf, 1 < / < N F , where Qf 
transforms in the N and Qf in the N representation of the gauge group. We will take Q (Q) to be an 
N x N F (N F x N), but for greater clarity we will often write the flavour indices explicitly. The action 
for the matter fields is 



'matter 



J d 4 x \v n q^V n q + V n qV n q ] - T> n xa n X + X^nVnX 
- V2igx\q + i\Plq ] \x + VZigqXx - V^igxXq 1 + \g 2 {q ] q - gg 1 ") 2 } 
The classical moduli space of the theory in the Higgs phase is given by [59,60]: 



(6.50) 
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(6.51) 



The VEV matrices in Eq. (6.51) correspond to the cases N F < N. The cases N F > N are similar except 
that the VEV matrices have extra rows of zeroes rather than columns, or vice-versa. These VEVs are 
not all independent; the D-flatness condition requires that for each value of /, 



\Vf\ 



\v f \ 2 + a 2 N F > N , 
\v f \ 2 N F < N , 



(6.52) 



where a 2 is an arbitrary constant, independent of the color index u. 

Now Eqs. (6.39) and (6.41) generalize to 42 

Xaf = g~ 1/2 Ub a fJC f , X fa = g- l/2 K- f fb a U 



and 



q f = U 







^UMfK,, q f =(q° f 0)U + j-iC f fMU, 



(6.53) 
(6.54) 



respectively. The leading contribution to the instanton effective action can be worked out in a completely 
analogous way to (6.49), yielding 



N F 



(6.55) 



42 



Here, and in the following, one should not confuse the flavour subscript / with the ADHM quantity /. 
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The Af = 1 supersymmetry transformation properties of the ADHM variables were constructed in §IV.5. 
To check the invariance of the expression (6.55), it is necessary as well to derive the transformation prop- 
erties for the Grassmann collective coordinates /C and /C associated with the fundamental fermions. As 
with the other collective coordinates, this may be straightforwardly accomplished by equating "active" 
and "passive" supersymmetry transformations on the Higgsinos x an d X- 111 this way one obtains: 



(6.56) 



It is now easily checked that the action (6.55) is invariant under the supersymmetry transformations 
(4.68), (4.70) and (6.56). 

It remains to determine how one integrates over the new Grassmann collective coordinates in the 
semi-classical approximation of the functional integral. As for the adjoint- valued fields, the non-zero 
mode fluctuation determinants cancel, up to a power of the Pauli-Villars mass scale fj,~ 2kNF . The 
measure for integrating over the matter zero modes follows from the inner-product formula 

/N F N F N F 2 N F 

d 4 x XfXf = Q- 1 / *x £ KffbaVPK, = --/^^£ / D//C / = ^ K } Kj , (6.57) 
f=i J f=i 9 J f =1 9 f=l 

employing (C.2b). The integration measure for the matter Grassmann collective coordinates is then 

n -2kN FpL -kN Fg kN F J d kN FlCd kN F £^ ( 6 58 ) 



For convenience, we define the combination 



^,N F) = n -2kN F / ^(AO . d kNF Kd kNF iC 

<m k Jm k 
and then a generalization of the instanton partition function (5.16) 



(6.59) 



Z 



(A^,JV F ) 



ID 



(M,N F )-S 



(6.60) 



to include the matter fields. 



We will see that the Af = 1 action (6.55) possesses two simplifying properties that the Af = 2 
action, to be discussed below, does not. First, Eq. (6.55) has the form of a disconnected sum of k single 
instantons; with the choice of these ADHM coordinates there is no interaction between them. Second, 
the only gaugino modes that are lifted (i.e., that appear in the action) are those associated with the top 
elements fi and p, of the collective coordinate matrices A4 and A4. This leaves 0(k) unlifted gaugino 
modes after one implements the fermionic constraints (4.34). This counting contrasts sharply with 
the Af = 2 theories in which the number of unlifted modes is independent of the winding number k. 
Saturating each of these unlifted modes with an anti-Higgsino as per Affleck, Dine and Seiberg [50] one 
sees that unlike the M = 2 theory, here the sectors of different topological number cannot interfere with 
one another, since the corresponding Green's functions are distinguished by different (anti-)fermion 
content. 
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VI. 3. 2 Af = 2 theories on the Coulomb branch 



Now we consider Af = 2 theories with N F flavors of fundamental hypermultiplets. Each such hypermul- 
tiplet comprises a pair of J\f = 1 chiral multiplets, Qf and Qf, with the same conventions for component 
fields as in the Af = 1 case discussed in the last section. In Af = 1 language, these matter fields couple 
to the gauge multiplet via a superpotential, 

Np 

W = V2gJ2Qf®Qf ■ (6-61) 
/=i 

In component form, the matter fields have the Euclidean action 

'S'mattcr 

- V2igxM + iV2q ] \x + V^igqXx ~ V2igx\cp - V2gx^q - V^gqifx ( 6 - 62 ^ 
- 9X<PX ~ V2gq j 4>X - V2gx4>q j - gxftx] + ^scalar , 

where Scalar are the interaction terms between the scalar fields whose explicit form we do not need. 
In what follows, we will restrict our attention to the Coulomb branch of the Af = 2 theory where the 
hypermultiplet squarks do not acquire VEVs. The classical component fields X/? X/> Qf an d Qf are still 
given by Eqs. (6.53)-(6.54), except that on the Coulomb branch the first terms on the right-hand sides 
of Eq. (6.54) are zero. The adjoint- valued fermions have their usual ADHM form A = g~ 1 / 2 A(A4 1 ) and 
if = g~ l l 2 h(A4. 2 ). To leading order, the scalar field satisfies the same equation-of-motion as in the 
pure Af = 2 gauge theory and so the solution follows from Eqs. (4.64) and (4.65) (along with definition 
of in (4.20)): 43 

<j>=iUM A fM A U + u( (f) ° )u, (6.63) 

where 

up = L 1 ( - | M A M A + . (6.64) 

On the other hand the anti-holomorphic component now satisfies the inhomogeneous equation 

2>V = -9XX ■ (6.65) 
The solution of this equation is readily shown to be 

0t = f/ ^ Ot )u, (6.66) 

where 

Np 

V 9t = ^ 1 (-i^/C / ^ / + ^0°W) . (6.67) 



43 In the following when discussing the M = 2 theory, we use the e tensor to raise and lower SU(2) i?-symmetry spinor 
indices in the usual way following the conventions of [47]. 
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Now we consider the matter fields themselves. The new feature in the J\f = 2 theory is that the fields 
and $ now satisfy the non-trivial equations 44 

£>V = -V2gxip , £>V = -V2gipx (6.68) 

with solutions 

9/ = ^ff M2u > 9/ = £fiM*fK f . (6.69) 

As usual, only the kinetic terms of the scalar fields and the Yukawa interactions involving the 
fermions (rather than the anti-chiral fermions) contribute at leading order to the instanton effective 
action: 



S= J d 4 x \v n tfV n <t> + V n q*V n q + V n qV n tf + 2ig[tf, 



(6.70) 

+ V^igq^Xx - V2igxM j - ^gX^Q - V2gqipX ~ 9X<PXj ■ 

This can be simplified by integrating the kinetic terms by parts and using the equations-of-motion for 
the scalar fields: 

S = J d A x [d n (^V n <f>) + d n (JV n q) + d n (qV n tf) - gx(V2i\tf + ^q + X )} , (6.71) 

The first three terms are converted into surface terms on a large sphere at infinity. Only has a VEV, 
so only the first term is none vanishing. Using 



V„0 > 



l -\2iii A Ji A + WaW^cf) + <j>°w & w* - 2w^w^ (6.72) 



X X' 

we find the contribution to S: 

4vr 2 tr fc { - i/xVW + w & \4> Q \ 2 w& - <pw & <jFw & } . (6.73) 

The contribution from the remaining Yukawa interactions can be evaluated in a similar way to 
the manipulations required to evaluate the instanton effective action in §V.2 (see also Appendix C 
Eq. (C.34) and [61]). The strategy is to write 

y/2i\q* + V2^jq + <j) X = 9~ 1/2 + ©) , (6.74) 
where 6 is a fundamental zero mode, f>Q = 0. We now verify that the solution for T and is 

Tj = -{UM A fA & M A fJC f + U ^A d //C/ (6.75) 

and 

e f = Ub a f<plC f . (6.76) 

44 As usual, to leading order where we set the anti-chiral fermions to zero and ignore the potential terms for the scalar 
fields. 
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First of all, using (2.51), the left-hand side of (6.74) is 

g 1/2 (y/2i\tf + V2^jq + <Px) =p{M A fb a VM A - b a fM A VM A 

-i _ fAfl n\ (6.77) 

+ M A fM A vb a \fK: f + ur o u )vb a f]C f . 

Then using the formula 

V a a(UJ) = Ud aA J - 2Ub a fAaJ , (6.78) 

we have 

Paa^ = ^ AB u[bJM A (l - V)M A + M A fb a VM A + M A fM A Vb a }flC f 
In the last term we can now use the identity (C.32), along with the expression for ip in (6.64): 



(6.79) 



p> 



&a = -hM A M A + {<P,r 1 } ■ (6.80) 



This simplifies (6.79) 

^ Q .T d = |C/{ - b a fM A VM A + M A fb a VM A + M A fM A Vb a ]fK f 

- (dP o\ - ' ( 6 - 81 ) 

+ U [I p) Vbaf}Cf + WoWf ■ 

Now one can see that apart from the last term, which is the fundamental zero mode O, this is precisely 
(6.77). So the contribution to S from the Yukawa terms is given by 



-g 1 ' 2 j d'xxQ, (6.82) 



which can be evaluated using the inner-product formula (6.57): 

Np 2 N F 

-7T 2 J2K f <plC f = ^tr k [( J2 KfKf) L 1 ( - \M A M A + w"<f>°Wa)] ■ (6.83) 

/=1 9 f=l 

Summing the result with (6.73) gives the leading order expression for the instanton effective action: 
1 S = 47r 2 tr fe <j -iji A <iFtJi A + w & \<i> \ 2 w & 



(6.84) 

+ (i J2 K f^f - ^^ w ») L 1 ( - I M A M A + w & <jPw^} 

As with the Af = 1 action (6.55), one can check that this expression is a supersymmetric invariant. 
Notice on the Coulomb branch, Eq. (6.56) collapses to 

SK f = 5JC f = 0. (6.85) 



We can also add Af = 2 preserving masses for the hypermultiplets as explained in §VI.4 below. 
Finally, the collective coordinate integration measure for the Grassmann collective coordinates is given 
by the same expression, Eq. (6.58), as in the Af = 1 theory. 
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VI. 4 Masses 



In certain circumstances one wants to add masses for some of the fields. Here we assess the affects 
on the instanton calculus. It turns out that the effect will be simple to incorporate when working to 
leading order in the semi-classical approximation. 

The interesting cases involve M = 1 preserving mass terms in either M = 2 or M = 4 theories. The 
simplest case consists of a mass term for a fundamental hypermultiplet transforming in the (N, N) as 
described in §VI.3. When Wick rotated to Euclidean space, the conventional mass term in components 
is 

S'mass = / d 4 x jmxx + m*xx + 2\m\ 2 q^q + 2|m| 2 gg t | . (6.86) 

In principle, these mass terms contribute new terms to the equations-of-motion which will affect the 
instanton solutions themselves. In applications one is typically interested quantities that are known 
to have holomorphic dependence on the mass. We then expect that the semi-classical approximation 
respects this dependence. 45 Hence for a holomorphic quantity, we can treat m and m* as independent 
variables and then set m* = 0. The mass terms then only affect the equations-of-motion of the anti-chiral 
fermions x an d X an d t ne supersymmetric instanton remains unaffected to leading order. Hence, we 
simply have to evaluate the first term in Eq. (6.86) in the background of the supersymmetric instanton 
and add the resulting expression to the instanton effective action S. The contribution can be determined 
using the inner-product formula (6.57). For N F such hypermultiplets 

7T 2 NF 

Smass - — ^ rrifiCffCf . (6.87) 
9 /=i 



Now we turn to masses for adjoint chiral superfields. As discussed above only the mass term for the 
chiral fermions is relevant. In the M = 4 theory the most general mass term for the chiral fermions is 
of the form 

S mass = [ d 4 xm AB tr N \ A \ B . (6.88) 



where tuab is a symmetric matrix which in a suitable basis is diagonal tuab = diag(mi, rri2, m^, m 4 ). In 
the general case, all the supersymmetry is broken, while if one, respectively two, of the eigenvalues vanish 
then the coupling arise from mass terms that preserve Af = I, respectively Af = 2, supersymmetry. To 
find the effect on the instanton effective action we substitute the ADHM form \ A = g^ 1 ^ 2 A(M A ) and 
evaluate the integral using Corrigan's inner-product formula (4.37): 

£ mass = -^^tr fc + l)M B = -^^tr fc [2/2 V + M' aA M'*} ■ (6.89) 



45 It should be possible to prove this by showing that the relevant Ward identity is respected. Such a proof would follow 
the lines of §VII.l which considers a different Ward identity. 
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VI. 5 The instanton partition function 



For many applications and for later conceptual developments it is useful to re-formulate the collective 
coordinate integral, or instanton partition function (5.16), by introducing some auxiliary variables which 
have the effect of the "linearizing" problem in a sense to be described. To start with one introduces 
Lagrange multipliers for the bosonic and fermionic ADHM 5-function constraints and also, for M > 1, 
other additional auxiliary variables. In Chapter X we will see for M > 1 this linearized form of 
the collective coordinate integral has an important relation to higher- dimensional field theories. This 
point-of-view will also be the starting point for understanding the relation of the instanton calculus to 
D-branes in string theory as described in §X.3. 

In the linearized formalism one introduces auxiliary variables in the form of: Xa a 2(Af — l)-vector of 

— * 

Hermitian k x k matrices; D, a 3- vector of Hermitian k x k matrices; and k x k matrices of Grassmann 
superpartners A = 1, . . . , Af. The instanton partition function (5.16) can be written as 

= y J lu(fc) k J d^ N+k ^a d 3k2 Dd 2 ^ k2 X J] d 2k ^M A d 2k2 i, A e~ s , (6.90) 

where the instanton effective action is 

S = 4nhl k {xaL X a + \^aA B M A M B X a) + S L .m. , (6.91) 

where 

Sum. = -4tn 2 tT k ^(M A a„ + d«M A ) + D • f^A*} . (6.92) 

The previous form of the collective coordinate integral (5.55) is recovered by integrating out the auxiliary 
variables {xa,D,vjj A }. Specifically, integrating out the Lagrange multipliers D and Tp A yield the 5- 
functions in (5.55) imposing the ADHM constraints and their Grassmann analogues, (2.65) and (4.34). 
The Gaussian integrals over Xa yield the appropriate power of det fc 2.L in (5.55) as well as the quadrilinear 
couplings of Grassmann collective coordinates of the N = 4 theory (5.26). 

One of the advantages of the linearized form of the partition function is that it is straightforward 
to incorporate scalar VEVs on the Coulomb branch in the M = 2 and M = 4 theories. One simply 
generalizes the following couplings in the instanton effective action S: 

WaXa -> WaXa + 4%U) & , XaW° -> XaW° + , ^ 

V A Xa -> V\a + ^ , Xafi A ~> Xa^ + ^<\>l ■ 

The effective instanton action is now explicitly 

S = 47T 2 tr fc { |^ X a + 0>a|MXaX] 2 + |£aAB^ , (6.94) 

where we used the definition of L in (2.125). The supersymmetry transformations on the linearized 

system are 

Ka = ih A M' A , 8M' A = -2iE^gte*> Xa] , (6.95a) 

Sw A = i&AV A , 5fi A = -2iZ AB & (w«Xa + <f>° a w & ) , (6.95b) 

5 Xa = -V AB l«A$% , 5r A = 2Z abA B {xa, Xb]& ~ iD ■ r^g (6.95c) 

5D = -iT A fFt B Z &B ®i, Xa] ■ (6.95d) 
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On integrating out the auxiliary variables {x a , D, iPa}, these transformations reduce to those constructed 
in §IV.5. 



In the M = 2 theory it rather simple to incorporate fundamental hypermultiplets. The instanton 
effective action, generalizing (6.94), is 

S = 47T 2 tr fc | \WaXa + 0>d| 2 ~ [Xa, dj + \Jl A {HAx} + 

N ? . (6.96) 
+ IM' A M' AX ] + lY^K-fiCAx ~ g-'mf)} + S L . m . , 
/=i 

where we have defined 

0° = 0?-*0°, x = Xl -i X2 (6.97) 

and we have allowed for arbitrary hypermultiplet masses (6.87). Integrating out Xa gives the instanton 
effective action (6.84). The supersymmetry transformations (6.95a)-(6.95d) are then augmented with 
(6.85). 

In applications of the instanton calculus it is useful to define the notion of the "centred instanton 
partition function". This is defined in terms of an integral over the centred instanton moduli space 
DJlk, (2.16), where the overall position coordinates and their superpartners, the Grassmann collective 
coordinates for the supersymmetries broken by the bosonic instanton solution (see §IV.2.3), 

X n = -k-Hr k a' n , £ A = lk-hr k M' A , (6.98) 

have been factored off. First of all, from the expression for the metric (2.66) we have 

ds 2 mk = 8n 2 k dX n dX n + ds^ . (6.99) 

Therefore taking account of the normalization of the measure in the non-supersymmetric theory (3.16), 
we have 

/ w = / (A7tk) 2 d 4 X ■ / w . (6.100) 

Now we consider the supersymmetric integral. The inner-product of the supersymmetric zero modes 
(4.43) is from (4.37) 

J rf 4 xtr w A(-4^ A l WxW )A(-4^ B l W x W ) = l^ 2 ki A i B . (6.101) 

This means 

f = [ (4nk) 2 d' l X f[(32n 2 k)- 1 d 2 ^ A ■ [ u> (M ' N ^ . (6.102) 

Jm k J A=1 Jm k 

Since the instanton effective action is always independent of X n and t; A , we can define the centred 
instanton partition function, generalizing (6.60) as 

(Af,iV F ) _ f r (M,N F )-S 



Z^>"r> = / ^ N ' N ^e- b . (6.103) 
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VII The Gluino Condensate in J\f = 1 Theories 



Pure Af = 1 gauge theory is in some respects the simplest supersymmetric gauge theory; however, a 
puzzle arose in the mid-1980's over the numerical value of the gluino condensate. In this section, we 
will describe how the puzzle — described in detail below — can be resolved using the calculus of many 
instantons and, in particular, by exploiting the simplifications that occur in the large- N limit. 

It is well established that M = 1 supersymmetric gauge theory has N physically equivalent vacua, 
as dictated by the Witten Index, which differ by the phase of the gluino condensate. By dimensional 
analysis one expects 

'^tr N \ 2 ) = c AV™^ , u = 1, . . . , N , (7.1) 

where A is the dynamical scale in the theory while c is a numerical constant. In particular, by a 
powerful supersymmetric non-renormalization theorem there can be no perturbative corrections to the 
form (7.1) since this would be a series in g and hence would not be holomorphic in A. 46 This suggests 
that if c can be calculated to leading semi-classical order then it will be exact. Remarkably, there 
are two approaches in the literature for calculating the gluino condensate each superficially appearing 
to be exact but which differ in their predictions of the constant c. This disagreement is especially 
vexing in light of the fact that both involve the use of a single supersymmetric instanton. The first, we 
shall call the "strong-coupling instanton" (SCI) approach [10,38,62-64], while the second, we call the 
"weak-coupling instanton" (WCI) approach [50,64-66]. 

In the SCI approach one calculates the instanton contributions to the correlation functions of g 2 tr^X 2 
directly in the strongly-coupled confining phase of the theory. Strictly speaking, it is therefore not a 
semi-classical calculation even though it uses instantons and ultimately we will conclude that this is 
why the SCI approach is flawed. In an instanton background, each insertion of g 2 trjvA 2 is quadratic in 
the Grassmann collective coordinates and since the charge-A; instanton has 2kN fermion zero modes, 
the latter can only contribute to the /ciV-point function. 47 Furthermore, by a Ward identity, which 
we prove in §VII.l is respected in the instanton approximation, the kN-point correlation function is 
independent of the kN spacetime insertion points {x®}. The one-instanton calculation of the Appoint 
function, reviewed in §VII.2.1, can be done exactly and yields the result 

£tr w A V>) x ... x £tr w A V*")) = (N _ ^ _ 1} ^ ■ (7.2) 

In order to extract (g 2 trArA 2 ) from the correlator (7.2), one then invokes cluster decomposition: taking 
| x 0) _ X U)\ ^> A" 1 , and remembering the independence of the correlator on {x^}, one simply replaces 
the left-hand side of Eq. (7.2) by (r^2tr N \ 2 } N . Taking the N th root, the net result, based on this 
one-instanton calculation, reads: 



16tt 



tr/vA 2 ) = -™ A 3 e 2 -' N , (7.3) 

/sci [(jV_i)!( 3 jV-l)] 1/JV 



46 In this context, A is a complex quantity carrying with it the phase e l9 / 3N , 

47 This can also be deduced from the relation between A and the running coupling (5.17) A 3N <~ e 2, ™ T (A<) p 
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where u — 1, . . . , N indexes the N vacua of the SU(iV) theory, and reflects the ambiguity in taking the 
jyth roo t_ i n retrospect — as argued in Refs. [10,63] — the reason why the naive instanton calculation of 
(Y^^-tr^A 2 ) gives zero is that the iV vacua are being averaged over with equal weight in the instanton 
approximation and the phases cancel. 48 

In contrast, in the WCI approach, one modifies the pure gauge theory by adding matter superfields 
in such a way that the gauge symmetry is either entirely broken, or only an abelian subgroup remains, 
after the Higgs mechanism. In this way the theory can be rendered weakly coupled in the infra- 
red and semi-classical approaches should be reliable. Here, we choose the former option by adding 
N F = N — 1 matter hypermultiplets in the fundamental representation of the gauge group. In the 
Higgs phase, all the n-point functions of gHriyX 2 receive contributions from constrained instantons. 
In particular, based on fermion zero-mode counting, /c-instantons now contribute uniquely to the k- 
point function. We will argue in §VII.l that, just as for the SCI approach, the supersymmetric Ward 
identity guaranteeing independence of the result on the insertion points, is respected in the instanton 
approximation. In particular, the one instanton contribution to the one-point function can be performed 
exactly, as reviewed in §VII.2.2. The non-renormalization theorems of M = 1 supersymmetric gauge 
theory then permit the analytic continuation of the answer into the confining phase by decoupling 
the extraneous matter fields. This is achieved by giving them a mass m, and taking the joint limit 
m — > oo and A (JVp) — > in the manner dictated by renormalization group decoupling. Matching the 
exact one-instanton calculation onto the effective low-energy theory without matter gives: 

(£,to»*) m = (7.4) 

Note that the renormalization group decoupling procedure forces the low-energy theory into one of the 
N degenerate vacua. The puzzle now reveals itself as the mismatch between (7.3) and (7.4). 

In this Chapter, following Ref. [67], we review this old controversy, using the many-instanton calculus 
that we have developed in previous sections. In particular, we shall calculate the /c-instanton contribu- 
tion to the kN-point correlator, in the SCI approach, and the /c-point function, in the WCI approach, 
in the large- N limit where the solution of the ADHM constraints described in §VI.l in conjunction with 
saddle-point methods simplify the instanton calculus. In a nutshell, our results cast serious doubt on 
the validity of the SCI calculations of the condensate. Specifically, we will demonstrate that in the SCI 
approach cluster decomposition is violated at leading order in 1/JV: 

(^AV 1 ') x • • • x Tfrtr^A V fcA °)) scl N T (l&^A 2 )^ . (7.5) 

On the contrary, we will present a new calculation showing that the WCI approach is perfectly consis- 
tent with clustering at leading order in 1/N . But we shall go further and argue that the WCI approach 
is also consistent with clustering for finite N. The important implications of this observation are as fol- 
lows. Since cluster decomposition is an essential requirement of quantum field theories (with very mild 
assumptions that are certainly met by supersymmetric Yang-Mills), the exact quantum correlators must 
have this property. The fact that cluster decomposition is violated by the instanton-saturated SCI cor- 
relators means that — contrary to claims in the literature — the SCI approximation is only giving part of 

48 The same argument implies that only the fcA^-point functions are non- vanishing, matching the selection rule arising 
from counting fermion zero modes in the fc-instanton background. 
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the full answer. Since the SCI correlators obey supersymmetric perturbative non-renormalization theo- 
rems [65] , it necessarily follows that additional non-perturbative configurations must be contributing to 
the correlators. On the contrary we will argue that the WCI approach, which is a genuine semi-classical 
technique, is consistent with cluster decomposition. We believe that this conclusion is perfectly natural 
and simply underlines the fact that instantons are a semi-classical phenomenon and should only be 
trusted at weak coupling. We should add that there are other non-instanton approaches to calculating 
the gluino condensate, for instance, via softly-broken J\f = 2 gauge theory [67] solved using the theory of 
Seiberg and Witten [68] , or from monopoles acting as instantons in the three-dimensional gauge theory 
that arises after compactification on a circle [69,70]. These alternative approaches all agree with the 
WCI answer. 



VII. 1 A supersymmetric Ward identity 

A fundamental property of correlation functions involving insertions of lowest components of gauge 
invariant chiral superfields, of which tr^A 2 is an example, is that they are independent of the insertion 
points [38,62]. It is of paramount importance for our subsequent arguments that this property is 
preserved within both the SCI and WCI approaches. To start with, let us briefly review the field 
theoretic argument. For any gauge invariant chiral superfield $ = A(x) + y^29ip(x) + • • • one can show 

J^A(x) = far^(x) . (7.6) 

Here, we have defined 5 a via the supersymmetry variation 5 = (, a 5 a + £q<5 q . In particular, 

9 - t r N A 2 = h (<t™ti N F mn X a ) . (7.7) 



dx 



n 



Now consider the derivative of a correlator of lowest-component fields with respect to one of the insertion 
points: 

° (Ax(xW) x • • • x A p (x^)) = \oT <A(* (1) ) • • ■■A p (x^)) . (7.8) 



The supersymmetry variation can then be commuted through the operators to the left and right, since 
for a lowest component of a gauge invariant chiral multiplet 5 a Aj(x) = 0. Furthermore, the vacuum 
is a supersymmetry invariant so the right-hand side of (7.8) vanishes and the correlation function is 
independent of the insertion point {x® . 

The question which we now address is whether the supersymmetric Ward identity described above is 
respected in either the SCI or WCI approaches. In an M = 1 supersymmetric theory, the supersymmetry 
transformation 5 a lifts to the collective coordinates as (4.68) and (4.70): 5 a a^ = i5 a ^M and 5 a M = 0. 
Using (7.7), this means 

- — trivA 2 (a;) = il^ ui - \- Jx iu — h M' ija T— f — )cr ma atr N F mn \ a (7.9) 

OX n V dW u i a UWi ua OQ'ijaa 

and, in addition, 5 a tiN A 2 = 0. In the instanton background, therefore, the derivative of the multi- 
point correlator of g 2 tr N \ 2 is equal to an integral over DJl k of a total derivative. We can use Gauss's 
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Theorem to write this as an integral over the boundary of 371^. The only possible contributions can 
come from the large sphere at infinity or the small spheres surrounding one of the insertion points. To 
judge whether the contributions are non- vanishing we need to determine the asymptotic behaviour of 
the integrand. 

First of all, consider the SCI instanton approach, for which the relevant collective coordinate measure 
is (5.55) with M = 1. Consider the contribution from the sphere of large radius R. By this we mean 
where the ADHM variables scale as 

tr k (w^ + a' n a' n ) R 2 . (7.10) 

The relevant asymptotic behaviour we need is 

trjv A 2 ~ iT 4 , tr N F mn \ ~ iT 4 , (7.11) 

so, collectively, the kN operator insertions scale as R~ 4kN . To complete the analysis we have to deter- 
mine the scaling of the supersymmetric volume form (5.55). There are two sources of R dependence: 
firstly there are Ak(N + k) — 1 integrals over the c-number collective coordinates on the boundary, giving 
ft4k(N+k)-i^ anc j secorLC iiy the the 5-functions for the bosonic and fermionic ADHM constraints scale as 
i?~ 4fc2 . 49 Overall, therefore, the volume form on the boundary scales as i? 4feAr_1 . Putting this together 
with the scaling of the operator insertions, we see that the contribution from the large sphere scales 
as R ikN ~ l x R~ 4kN = l/R and therefore vanishes as R — > oo. Now consider the behaviour on a small 
sphere of radius R around one of the insertion points x^: 

ti k (w a Wa + (a' n + x%h [k]x[k] )(a' n + a# ) l [ k]x[k])) R ~° R 2 ■ (7.12) 



On this sphere, the insertion at scales as i?~ 4 , while the other insertions at other points remain 
finite. 50 The collective coordinate integral over this boundary scales as i? 4fc7V_1 . Therefore, as long 
as we avoid making all the insertions at the same point, there is a vanishing contribution as R — > 0. 
Since there are no contributions from the boundaries, we conclude that the integral vanishes and the 
supersymmetric Ward identity is respected within the SCI approach. 

Now we turn to the same considerations in the WCI approach. In this case, k instantons contribute to 
the /c-point function rather than the kN-pomt function. To leading order in the semi-classical expansion, 
even though the instantons are now constrained, the insertions take their ADHM form and so satisfy 
(7.9). Hence, we use the same logic to write the -derivative of the fc-point function with respect to 
one of the insertion points as a total derivative on Wl k - However, there are some new subtleties. Firstly, 
since we are on the Higgs branch, there is a non-trivial instanton effective action (6.55) characteristic of 
a constrained instanton calculation. Since it is a supersymmetric invariant though, we can pull 5 a past 
e~ s , as well as the other insertions, in order to apply Gauss's Theorem. However, the instanton effective 
action (6.55) will modifies the asymptotic behaviour on the large sphere of radius R. The bosonic terms 

49 The 3k 2 bosonic ADHM constraints are quadratic in the bosonic ADHM variables and so scale as i? _6fc2 while the 
2k 2 fermionic ADHM constraints are linear in the bosonic variables and so scale as R 2k , giving R~ 4k . 

50 Note that this argument is only valid if the insertion points are distinct. We shall find out by explicit calculation 
that ambiguities can arise if too many insertions are made at the same point. 
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in S effectively prevent 4k(N — 1) of the variables {w ui a,wf u } from becoming large on the boundary 
manifesting the fast that the instantons are constrained and there is a cut off on their size. Taking this 
into account, the measure now scales effectively as i? 4fc_1 on the sphere, rather than the _R 4fcAr_1 of the 
SCI approach. Each of the insertions (which are equal to their ADHM expressions to leading order) 
still scale as (7.11) and so the insertions, together, scale as R Ah . Overall, measure and insertions scale 
as 1/R and so there is a vanishing contribution as R — > oo. Likewise it is straightforward to show that 
there are no contributions from around the insertion points themselves. The point is that in this case 
the instantons are small and so the constraining plays no role and the analysis is identical to the SCI 
case above. Consequently the /c-point correlations functions are independent of the insertions points in 
the WCI approach. 



VII. 2 One instanton calculations of the gluino condensate 



In the following subsections, we review the one-instanton SCI and WCI calculations of the gluino 
condensate. 



VII. 2.1 Strong coupling 



The strong-coupling one-instanton calculation was done originally for gauge group SU(2) in [38] and 
then extended to the SU(iV) theories in [63] (see also the very comprehensive review articles [10,12]). 

To begin with the supersymmetric volume form on the instanton moduli space is given in (5.55). 
However, since at the one-instanton level N > 2k = 2, we find it more convenient to use the expression 
given in (6.29) where the ADHM ^-functions have been resolved: 



/ 



2 3N 7T 2N - 2 



m (N-l)\(N-2)\ 



J p 4N - 8 d 4 Xdp 2 d^ N -^Ud 2 M'd 2 CS N - 2 ^S N - 2 ^ . (7.13) 



Here, d 4 ( N ~^U is the unit normalized (J d 4 ( Ar-1 )U = 1) volume form for the gauge orientation of 
the instanton. We have identified X n = —a' n , the position of the instanton, p = a/W^°/2, the scale 
size, and {A4', (} as the Grassmann collective coordinates associated to broken supersymmetric and 
superconformal invariance. In particular, recall from §VI.l, the definitions 

H = w & C + u, // (,,-,«•''' • v . (7.14) 

By explicit evaluation, using (4.29) and the one-instanton formulae of §11.4.3, the gluino insertions in 
the one-instanton background are 

g tr N \ (x) = oH — j -. (7.15) 

{(x-xy + p*) 3 ( (x _x)2 + p 2) 4 

We then insert Ylf =1 -^jtTN\ 2 (x^) into the collective coordinate integral. Since in the confining phase 
there is no symmetry breaking and the insertions are gauge invariant there is no dependence on the 
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gauge orientation of the instanton and one can simply integrate over the associated collective coordinates 
J d^ N ~ lS) \i = 1. Next, let us carry out the Grassmann integrations. Obviously the £ and M! Grassmann 
integrals have to be saturated from the insertions at two points {x^\ x^} chosen from amongst the N. 
After integrating over ( and Ai', the contribution from this pair is 

36p 8 (*<*> - x«) 2 



((xW - X) 2 + p 2 ) 4 ((x0') - X) 2 + p 2 ) 4 ' 

Now we take advantage of the fact that the iV-point function is independent of the x^\ to choose the 
insertion points for maximum simplicity of the algebra. The simplest conceivable such choice, #W = 
for all i, turns out to give an ill-defined answer of the form "0 x oo" (the zero coming from the Grassmann 
integrations as follows from Eq. (7.16), and the infinity from divergences in the p 2 integration due to 
coincident poles). In order to sidestep this ambiguity, one chooses instead: 

x W = ... = x^- 1 ) = 0, = (7.17) 

This choice is the simplest which gives a well-defined answer with no "0 x oo" ambiguity. More am- 
bitiously, one can still perform the calculation even if all the insertion points are taken to be arbi- 
trary [10,63]; however, we find it convenient to take the minimal resolution provided by (7.17). Due to 
the (x^ — x^) 2 factor in Eq. (7.16), it follows that the pair of insertions {x^\x^} responsible for the 
{Ai', (} integrations must include the point x^ = x; there are N — 1 possible such pairs, giving 

36(iV-l)^ 2 (7ig) 



((:r-X) 2 + p 2 ) 4 (X 2 + p 2 )^ 



for these contributions. The remaining Grassmann integrations over {u, u} are saturated by the N — 1 
insertions at = 0, and give 

4p 2 



(N-2)\ 



o i N-2 



(X 2 + p 2 )3 



(7.19) 



Combining the denominators in Eqs. (7.18)-(7.19) with a Feynman parameter a, 



- < 3iV+1 > ! (\ a ° 3 ' 1 -°» 3W ' 3 ^ (7.20) 



(A'* + p2)3 W -2 ( (l _ Xf + p 2)4 3, {3N _ 3)! J o ((qx _ x)2 + a(1 _ ^ + p2) 3« + 2 

and performing the X integrals yields: 

T tT N X 2 (x^)x---X 1 ^tT N X 2 (x^) 

pi poo 

/ da / dp 2 (p 2 ) 
Jo Jo 



9 2 
16vr 2 

3(3iV - 2) 2"- V*V^ , f 1 r , ^ ^(l-ar 
^(X-2)! i da i d/? {P) (f + a {l-a)x*) 

3(3X-2)2 Af " 1 p 3JV e 2 " r ^ 

# 2Ar (iV-2)! 
2 n A sn 

(N-l)\ (3N-1) ' 



/ daa 2 (l-a) 3JV 1 



(7.21) 



in agreement with Eqs. (7.2)-(7.3). In the final expression, we have used (5.17) to remove any dependence 
on the running coupling in favour of the A-parameter. 
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VII.2.2 Weak coupling 



Next, let us review the weak coupling instanton calculation of the gluino condensate. These kinds of 
calculation were originally done in [50,64-66] and reviewed in [12]. The strategy here is to add sufficient 
matter fields in order to completely break the gauge group by the Higgs mechanism. For matter fields 
transforming in the fundamental representation, this means we need add Np = N — 1 hypermultiplets, 
that is N — 1 chiral multiplets in both the N and N representations. On the Higgs branch, the scalar 
fields of the chiral multiplets qj and g/, / = 1, . . . , Np gain a VEV. For the case Np = N — 1 we 
can choose the VEVs as in (6.51). For large values of the VEVs, the theory is weakly coupled and 
semi-classical methods should be reliable. In contrast with the SCI calculation, in the weakly-coupled 
Higgs phase ^ 2 trjyA 2 ) receives a non-zero contribution directly at the one-instanton level. Decoupling 
the extraneous matter and matching to the low-energy pure gauge theory is then accomplished using 
standard renormalization group prescriptions. What is absolutely crucial is that, due to holomorphy, 
the value of the the gluino condensate is not renormalized as the matter is decoupled and in this way 
a result calculated at weak coupling can yield a result also valid at strong coupling. 

We now calculate the instanton contribution to the one-point function (jl^-tr^A 2 ) in the theory with 
Np = N — 1. The instanton calculus in theories with fundamental matter fields is described in §VI.3. 
Since the scalar fields in the matter sector have VEVs the calculation involves constrained instantons. 
As we have seen, to leading order (which in the present context is exact) the effect of constraining the 
instantons appears at the level of the instanton effective action which becomes a non-trivial function of 
the collective coordinates as is evident in (6.55). 

The existence of the Np fundamental hypermultiplets leads to new Grassmann collective coordinates 
{/C, 1C\ as described in §VI.3. The new feature of the semi-classical limit are integrals over these new 
collective coordinates Eq. (6.58). To leading order in the semi-classical approximation, the instanton 
measure is obtained by amalgamating (5.14), for J\f = 1, with (6.58): 



where the volume form oj (Ar ' NF) is defined in (6.59). 

In order to calculate the gluino condensate, we must insert into (7.22) (with N F = N — 1) the one 
instanton expression for jf^-tr^A 2 . Note that in contrast to the SCI approach, in this WCI calculation 
a single instanton contributes directly to the one-point function. However, since the matter fields have 
non-vanishing VEVs the instantons are constrained as described in §IV.4. This seems to preclude an 
evaluation of the condensate because we do not actually know the exact profile of A in the constrained 
instanton background. However, we do know the profile of A in the core of the instanton where it is 
well approximated by its ADHM form (4.29). Now suppose we simply use the ADHM expression for 
y^tr^vA 2 (written in (7.15)) have we any right to expect this to be a a good approximation? At first 
it appears not because the ADHM profile behaves differently in the tail of the instanton where it has a 
power-law fall off rather than the exponential fall off of the constrained instanton. However, this would 
only be a problem for large instantons which are in any case suppressed by the instanton effective action 




(7.22) 
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(7.27). Consequently the error made in substituting the ADHM profile for the insertion will be higher 
order in g and as long as we are only after the leading order expression, the ADHM profile will suffice. 
Here is the rub: we know that the condensate cannot receive any corrections in g by holomorphy and 
so the result we obtain will therefore be exact. 

The insertion -^2tr N \ 2 in the background of one instanton is given in (7.15) and is quadratic 
in Grassmann collective coordinates. Since in the measure (7.22) includes an integral over the two 
Grassmann collective coordinates associated to broken supersymmetry transformations M 1 and the 
instanton effective action (7.27) is independent of them, the insertion must saturate these integrals: 

/ d 2 M'tr N \\x)=g^- 6 ( . (7.23) 

The instanton effective action is likewise independent of the position of the instanton X n and so the 
integrals over these collective coordinates give 

d 4 X j = — A . (7.24) 

( (a; _X) 2 + p 2 ) 4 6p 4 



/ 



What remains is then a supersymmetrized integral over the centred moduli space 9DTi. In fact the gluino 
condensate in the case Np = N — 1 can be written simply in terms of the centred instanton partition 
function (6.103) as 

'£,tr N \ 2 ) = lp™+lg-™ e ™r%»=i,» F =»-i) (725) 



(7.26) 



At the one-instanton level, since N > 2k = 2, we can use results from §VI.l to resolve the bosonic 
ADHM constraints. When this has been done the centred one-instanton partition function is 

n3N-3„2N-2-2N F f 
~(M=hN F =N-l) _ z 71 / AN-8 J n 2 .4(iV-lW 

Ll ~ (N-l)\(N-2)\ J P dp d U 

2 

x d N fid N fid NF ICd NF IC \\5(w^ + JiWc) e~~ s , 

d=l 

where the the instanton effective action is (6.55) (with the choice of VEVs (6.51)) 

N-l . 2 N-l 

s = 2ttV £ l^/l 2 (|ix/i| 2 + |u /2 | 2 ) + ^ W^/ - v*Mf) ■ ( 7 - 27 ) 

f= i v z f=i 

Here, we have used the one-instanton form for Wa in terms of the gauge orientation 11 in Eq. (2.94). 
Notice that the form of the measure that we will use (7.26) is rather schizophrenic, since we have chosen 
to resolve the bosonic ADHM constraints as in §VI.l, but not the fermionic ones, as indicated by the 
remaining Grassmann ^-functions in (7.26). 

We now evaluate the partition function explicitly. First of all, the {/C/,/C/} and u ^ N, 

integrals are saturated by pulling down powers of the Yukawa coupling terms in the instanton effective 
action (7.27). This yields a constant term 

N-l 



/ ti-2 N 2(iV-l) 



(7-28) 

/=i 
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The integrals over the two remaining Grassmann variables {[1n,Pn} are saturated by the Grassmann 
(^-functions: 

/ dfi N dfi N Yl 5 (wNa^N + UnWno) = p 2 (\U N1 \ 2 + \U N2 \ 2 ) ■ (7.29) 

J a=l 

Putting all these factor together, we have the remaining bosonic integrals to perform 

_4Af-8o27V-l N ~ 1 r 

zr^— = = {N n _ iy l N _ 2)l n / *xdw-*u 



{N-\)\{N-2)\ Al 

x p 4Ar - 6 (|U Jvl | 2 + lU^I 2 ) exp[-27rVX;|t; / | 2 (|lX /1 | 2 + |U /2 | 2 )] . 

/=i 



(7.30) 



The integral over the scale size is 

JV-l 



r^p-- 6 exp[-2 7 rV^,J(|U /1 | 2 + |lX /2 | 2 )] =- ^-Jl (7 .3l) 



This leaves the remaining integral over the gauge orientation 11 which can be done by using the formulae 
in Ref. [71] : 51 

iwu„i» . ( , 32) 

Collecting the results, we have 

JV-l 

£("=wv-D =2 JJ^)-! (7.33) 
/=i 

and hence 

JV-l 

toA 2 ) = v 2N+1 9- 2N e 2 ^ f[ (w)" 1 . (7.34) 

/=i 



Teb 2 " 1 



The result (7.34) is expressed in terms of bare quantities and involves the Pauli-Villars mass-scale \x. 
We need to re-express it in terms of renormalized quantities. Firstly, the bare VEVs Vf and Vf should 
be replaced by Z^ 2 vJ n ' and Z^vJ^, respectively, where Zf is the usual multiplicative wavefunction 
renormalization factor. From now on we shall drop the "ren." superscripts on the renormalized VEVs 
with the understanding that, henceforth, all quantities are understood to be renormalized. The coupling 
constant g and multiplicative renormalization factors Zf run with the Pauli-Villars mass scale \x in such 
a way that the remaining factors are equal to a certain power of the strong coupling scale of the theory 
in the Pauli-Villars scheme: 

TV— 1 

a 2 ^; = e-^^+v^ 1 ^)- 2 * n z ^y i . ( 7 - 35 ) 



51 



Alternatively, making the choice Vf =vj =v yields an elementary integral. 
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generalizing the N F = relation (5.17). Note that due to holomorphy this relation must be exact. By 
differentiating with respect to \x this yields an exact expression for the /3-function. It is an example of 
the more general relation for arbitrary Np 

N F 
/=1 

which upon differentiation leads to the famous Novikov-Shifman-Vainstein-Zakharov /3-function of su- 
persymmetric QCD [12,72]: 

da a 2 3iV-E£i(l-7/) dlnZ 

^ = -27 T=m ' a = T^ ^ = -7w7 (7 - 37) 

The final result for the gluino condensate is 

JV-l 



9 2 
16tt 2 

/=1 



tr JV A 2 ) = A^+fn(w)" 1 - ( 7 - 38 ) 



We still have to relate the value of the gluino condensate (7.38) in the Higgs phase to that in the 
confining phase. In other words, we have to track the value of the gluino condensate as we decouple 
the matter fields. In order to achieve this in a controlled way, it is useful to re-interpret (7.38) in terms 
of a coupling in the low energy effective superpotential W e fr for the matter fields in the Higgs phase. In 
order to determine the superpotential we exploit the general functional identity (see for example [73]): 

tr N \ 2 ) = 6„ X A A (W cS ) . (7.39) 



16w 



2 1 



Here, A is the appropriate strong coupling scale and b is the first coefficient of the /3-function, equal 
to 3N — N F = 2N + 1 in the present context. Since, the expression for the superpotential must be 
invariant under gauge and global symmetries, it can be uniquely determined: 

W eS = . (7.40) 

det QQ y ! 

This is the famous Affleck-Dine-Seiberg (ADS) superpotential for M = 1 supersymmetric SU(iV) QCD. 
(See Ref. [50] for the SU(2) case and Ref. [74] for the SU(iV) calculation.) Note we have arrived at the 
ADS superpotential by a somewhat different route to Refs. [50,74]. In those references, the form of the 
superpotential was deduced by first noticing that after the Higgs mechanism, there is a combination of 
the matter fields which remains classically massless. The ADS superpotential then implies that these 
classically massless fields receive a non-perturbative mass through a one-instanton effect. This mass, and 
hence the form of the superpotential itself, can be deduced by calculating the long distance behaviour 
of the two-point function of the massless component of the anti-Higgsinos x an d X- The computation 
involves using a single constrained instanton but to leading order we can replace the massless component 
of x an d X by their value in the ADHM instanton background. 52 The net result of this calculation is 
entirely consistent with our approach via the gluino condensate. 

52 Notice that this involves iterating the constrained instanton to the required order where the anti-chiral fermions are 
non-zero. 
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As it stands, the ADS superpotential implies that the vacuum runs away away to infinity. One way 
to stabilize the theory is to add a small mass term. Using the ubiquitous holomorphy argument, the 
mass cannot get renormalized and will appear directly in the effective superpotential: 

^2AT+l N-l 

det QQ JT[ 

The superpotential (7.41) now has well defined critical points at 

mfv f v f = 'P' _ . (7.42) 
\[ f/=1 v f v f 

It is easy to see that there are N solutions where VfVf differ by iV th roots of unity. At these critical 
points the values of the gluino condensate obtained by using (7.39) are 

^M 2 )=(ASn«'/) 1/ ". (7-43) 



16tt 2 

/=1 

th 



which exhibits the A-fold degeneracy explicitly via the choice of the N root. 

The result (7.43) is justified in the weakly-coupled Higgs phase where the VEVs are large (or masses 
small) compared with A (JV _ 1} . However the powerful Ward Identities of supersymmetric theory allow us 
to extrapolate the result into the regime of small VEV (large masses) and strong coupling. The point is 
that since trjy A 2 is gauge-invariant and the lowest component of a chiral superfield, its VEV must be a 
holomorphic expression in the coupling constants of the theory. In the present context this means A (JV _ 1} 
and the masses. This implies that the result (7.43) cannot be subject to any perturbative corrections 
since a power series in g would translate into an expression which could not be holomorphic in A (JV _ 1) . 
Hence, (7.43) must also be valid for small VEVs, or, correspondingly from (7.42), large masses. This 
is precisely the limit in which the matter fields are decoupled and the theory should flow into the pure 
N = 1 supersymmetric gauge theory. In particular the N supersymmetric vacua that we see at weak 
coupling are continuously connected with the N supersymmetric vacua of the pure gauge theory. In 
order to track the value of the gluino condensate from weak to strong coupling we simply have to match 
the A-parameters of the two theories. The correct renormalization group matching in this case is [54, 75] 

N-l 

A 3iV = A™+J J] -/ • (7-44) 
/=i 

Finally putting (7.43) together with (7.44), we deduce 

^tr^A 2 ) = A 3 e 2 ™ /iV , (7.45) 
for u — 1, . . . , N which is the expression in (7.4). 



VII. 3 Multi-instanton calculations of the gluino condensate 

In this section, we calculate the contributions of arbitrary numbers of instantons to the gluino condensate 
in both the SCI and WCI approaches at large N where saddle-point methods are available to simplify 
the instanton calculus. 
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VII. 3.1 Strong coupling 



In §VI.l, we derived a form for the multi-instanton volume form in a supersymmetric gauge theory 
with iV > 2k. In the confining phase there is symmetry breaking and all insertions are gauge invariant, 
hence, we can immediately integrate over the gauge orientation: J d 4k ^ N ~ k ^U = 1. This gauge invariant 
form for the multi-instanton volume form is now particularly useful for taking a large- N limit. 

In order to calculate the /ciV-point correlation function, we must insert into the measure the expres- 
sion for yl^-tr^A 2 evaluated in the k ADHM instanton background. At the multi-instanton level we 
find it useful to use the identity (C.22) to express the insertions as 

tr^A 2 = -^ntr k M(V + l)Mf . (7.46) 

As proved earlier, the correlation function is expected to be independent of the insertion points and 
so, as in the one-instanton sector in §VII.l, the {x^} can therefore be chosen for maximum simplicity 
of the algebra. The simplest conceivable choice, = for all /, results in an ill-defined answer of 
the form "0 x oo" (the zero coming from unsaturated Grassmann integrations, and the infinity from 
divergences in the bosonic integrations due to coincident poles); we have already noted this fact in the 
one-instanton sector in §VII.2.1. The simplest choice of the {x^} that avoids this problem, generalizing 
(7.17), is 

x (l) = ... = x (kN-k*) = ) 

x (kN-k 2 +l) = . . . = x (kN) = x ( 7 - 47 ) 



which we adopt for the remainder of this section 
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In the large- N limit, there is a large preponderance of insertions (7.47) at = 0, and the resulting 
factor of (tr N \ 2 (0)) k ( N ~ k \ taken together with the Jacobian factor \det 2 kW\ N ~ 2k from the measure 
(6.29), dominate the integral and can be treated in a saddle-point approximation. Below we will carry 
out this saddle-point evaluation in full detail, but we can already quite easily understand the source of 
the linear dependence on k in our final result. The argument goes as follows: 

(i) Let us imagine carrying out all the Grassmann integrations in the problem. The remaining large- 
integrand will then have the form exp ( — AT + 0(\ogN)) where T might be termed the "effective 
large- A^ instanton action." The large- A^ saddle-points are then the stationary points of T with respect 
to the c-number collective coordinates. By Lorentz symmetry, T can only depend on the four k x k 
matrices a! n through even powers of a! n . (This is because the bulk of the insertions have been chosen 
to be at = 0; otherwise one could form the Lorentz scalar x n a' n and so have odd powers of a' n .) It 
follows that the ansatz 

c4 = 0, n = l,2,3,4, (7.48a) 
W c = 0, c= 1,2,3 (7.48b) 



53 As a nontrivial check on the Ward identity, we have also numerically integrated the large- N correlator for insertions 
other than Eq. (7.47), and verified the constancy of the answer presented below. 
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is automatically a stationary point of T with respect to these collective coordinates. (Note that (7.48b) 
follows automatically from (7.48a) by virtue of the ADHM constraints (6.3).) It will actually turn out 
that, once one assumes these saddle-point values, T is independent of the remaining collective coordinate 
matrix W°; furthermore we will verify that this saddle-point is actually a minimum of T. 

(ii) Having anticipated the saddle-point (7.48a)- (7.48b) using these elementary symmetry consid- 
erations, let us back up to a stage in the analysis prior to the Grassmann integration, and proceed a 
little more carefully. Evaluating the insertions tr^A 2 ^^) on this saddle-point, one easily verifies that 
the C modes vanish when = 0; consequently the ( integrations must be saturated entirely from the 
k 2 insertions at = x. This leaves the M', v and V integrations to be saturated purely from the 
insertions at = 0. Moreover, because M! carries a Weyl spinor index a whereas v and v do not, the 
tr/vA 2 (0) insertions depend on these Grassmann coordinates only through bi-linears of the form V x v 
or M' x M!\ there are no cross terms. 

(iii) Performing all the Grassmann integrations then automatically generates a combinatoric factor 

(ey.(ey.(kN-2ey. [ kN ~ 2 e ) . (7.49) 

Here the first three factors account for the indistinguishable bi-linear insertions of the (, A4', and {u, 9} 
modes, respectively, while the final factor counts the ways of selecting the k 2 bi-linears in M! from 
the kN — k 2 insertions at = 0. Multiplying these combinatoric factors together, as well as the 
normalization constants from then instanton measure, and taking the (kN) th root yields, in the large- ./V 
limit: 

[A k {C^) k (k 2 )\ {kN - k 2 )\\ 1/hN 2 3 n 2 eN~ 1 kA 3 + 0(^ 2 ) . (7.50) 

Remarkably, apart from a factor of four, this back-of-the-envelope analysis precisely accounts for the 
leading term in the 1/N in the final answer, Eq. (7.63) below. Note that most of the remaining 
contributions to the saddle-point analysis, which involve a specific convergent bosonic integral derived 
below, as well as the factor 2 3fc2 /Vol U(fc) from Eq. (6.29), reduce to unity when the (kN) th root is 
taken in the large- N limit; the missing factor of four will simply come from the leading saddle-point 
evaluation of the bosonic integrand. 

Here are the details of the large- N calculation of the /ciV-point correlation function. First, we find 
it convenient to partially fix the auxiliary U(k) symmetry by taking a basis where W° is diagonal: 

H/° = diag(2p 2 ,...,2p 2 ) . (7.51) 

As the notation implies, in the dilute instanton gas limit p« can be identified with the scale size of the 
2 th instanton in the /c-instanton sector (see §IV.2). The appropriate gauge fixing involves a Jacobian: 

i r n3fc(fc-l)/2_-fc poo k k 

votW / dt ' w ° - I n ^ n w - & . 

V 1 J >/U 1 = 1 !<J = 1 

For k — 1 one has, of course, J dW° — > 2 dp 2 . 
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and from Eq. (7.46), 



where 



Now let us consider the Grassmann integrations, beginning with the ( variables. We assume the 
saddle-point conditions (7.48a)- (7.48b), in which case 

A= (-C,J' < 7 - 53 > 

k 

tr N \ 2 (x) = J2tt«h(Ch F ij(x) + ••• , (7-54) 

Fij(x) = j-D TT - 2 , (7.55) 

Ag (x 2 + pt)(x 2 + p 2 ) 

and the terms omitted in Eq. (7.54) represent dependence on the other Grassmann collective coordinates 
{Ai', v, v\. It is obvious from Eq. (7.55) that i*V,(0) = 0, so that the ( must be entirely saturated from 
the k 2 insertions at = x as claimed above. Performing the ( integrations then yields 

k 

(A: 2 )! J] • (7-56) 

Next we consider the insertions at = 0. Focusing on the M 1 modes first, one finds from Eq. (7.46): 

k 

tr N \ 2 (0) = H £ (M' a UM' a h (pr 4 + Pj 4 + pfpf) + • • • , (7-57) 

9 i,j=l 



omitting the v x v terms. Hence the M! integrations yield 

2^_ (kN - k 2 

g k2 \ k 



( kN ~, e ) (k 2 )\ II (ft* + P7 4 + PfP?) > (7-58) 
^ ' i,j=i 

where the combinatoric factors in (7.58) (as well as in (7.56)) have been explained previously. 54 



Finally we turn to the {is, is} integrations. Since (unlike ( and Ai') the number of is and v variables 
grows with N as kN — 2k 2 , it does not suffice merely to plug in the saddle-point values (7.48a)-(7.48b) 
and (7.53). One must also calculate the Gaussian determinant about the saddle-point, which provides 
an O(N ) multiplicative contribution to the answer. Accordingly we expand about (7.48a)-(7.48b) to 
quadratic order in the a' n . From (7.46), the is x v contribution to tr/vA 2 (0) has the form 



~tr fc vvUj 

2g 



= -tr fc uisfb a Vb a f 
x =o g 



(7.59) 



x=0 

as follows from the differentiation formula (C.2b). Performing the {is, 9} integrations therefore gives 
(A;iV-2A; 2 )! exp \(N - 2k)tr k log (2fb a Vb a f) I ) 



I x=0 

k 



(kN -2k 2 )\ exp{(iV -2A;)(logdet fc 16(^ )- 2 - | X)(O -(Oii (ft 2 + Pj 2 ) + ■ 

(7.60) 



54 0ne can easily check that these large- N formulae are consistent with the explicit 1-instanton calculation presented 
in §VII.2.1 which is exact in N. 
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The negative sign in front of the quadratic term in a' n confirms that our saddle-point (7.48a)- (7.48b) 
is in fact a minimum of the action. Combining this expression with the measure factor in Eq. (6.29), 
namely 

\det 2k W\ N ~ 2k = exp((iV-2A;)logdet 2fc W0 = exp {(TV - 2k) ( logdet fc (|^ ) 2 + 0(a'J 4 )} , (7.61) 
and performing the Gaussian integrations over a' n , yields: 



A: 



2 



^--.(W-^n^^j + , (7.62) 
where the omitted terms are suppressed by powers of 1/N. 

Finally one combines Eqs. (6.29), (3.23), (7.56), (7.58) and (7.62) (and the definition of the A- 
parameter in (5.17)) to obtain the leading-order result for the correlator: 



(7.63) 



^A; 



(^tr.vA V 1} ) x • • • x ^tr^AV*")) 

at-.oo 2 kN+k2 ~ k+1/2 n- k+1 / 2 e kN (k 2 )\ k kN ~ k2 + 1 / 2 \ 3kN 

> 32A: 2 J\fkN+k 2 -l/2 fc\ 

where is the convergent integral 

% = U d PiU \P" - Pi ■ II ^(x) (1 - {pi/* + Pi /p 3 )- 2 ) . (7.64) 

^° 1=1 i,j 1,7=1 



Note that "Jk is independent of x as a simple re-scaling argument confirms. For the simple case k — 1, 
Ji = § and the expression (7.63) agrees — as it must — with the large- N limit of the one-instanton result 
in Eq. (7.21). 



VII. 3. 2 Weak coupling 



In this section, we turn to multi-instanton effects in the WCI approach. We apply the same large-iV 
saddle-point methods to extract the large- iV behaviour of the /c-instanton contribution to the Appoint 
of ^f^-tr^A 2 . In order to make the calculation more manageable, we will choose the VEVs from the 
outset to be v 1 — • • • — i>jv-i — Vi — • • • — Cat-i = v. In this case, the bosonic part of the instanton 
effective action (6.55) can be written as 

S h = 7rVtr 2fc (^ - £W£) . (7.65) 

Here, W and £ are the 2k x 2k matrix defined in (6.1) and (2.83), respectively, and the complex 2k 
vector Uia is defined in terms of the gauge orientation via 

u = (U 1N , . . .,U 2 k,N) ■ (7.66) 

Notice that the correlation function only depends on the gauge orientation through the vector u in the 
instanton effective action (7.65). So we need an expression for the reduced measure J dU = J d 2k u f(u). 
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We now pause to evaluate the function f(u). The complex 2k- vector u is composed of the first 2k 
components of the complex unit iV-vector U = (Uuv, . . . , Unn)- Let u = (1i2k+i,N, . . . , Unn) be the 
complementary complex (N — 2 A;) -vector: U = u + u. It is clear that the measure on the unit vector U 
inherited from the group measure J dU is proportional to 

J d 2N U5(\U\ 2 - 1) = J d 4k ud^ N - 2k ^u5(\u\ 2 + \u\ 2 - 1) . (7.67) 

The measure on u is obtained by integrating over u giving (up to constant) 

^(l-lul 2 )^" 2 *- 1 . (7.68) 



/ ■ 

J\u\<l 



Actually (7.68) is still too general for our needs. It will be sufficient for our purposes to know the 
measure on the k quantities Zi = ufu i6l (with no sum on i). The measure on the {^} follows easily 
from (7.68). Finally taking into account the normalization (6.17), we have the required expression for 
the reduced measure of the gauge orientation: 

/ - 2nS) jf { n** «} (1 - 1 -r*- 1 . «m 

with the constraint Yli=i z i — 1- 

We now have all the ingredients needed to start a large- N analysis. In this limit, we will evaluate 
the collective coordinate integral by a method of steepest descent. As usual in such a saddle-point 
analysis, one exponentiates any terms in the collective coordinate integral raised to the power N, and 
includes them, along with the bosonic terms in the instanton effective action, in a large-iV effective 
action. There are two contributions besides the instanton effective action (7.65). The first, is the factor 
of |det 2 fcW / | iV in the measure (6.29) and the second the factor of (1 — in the measure for the gauge 

orientation (7.68). Placing all these contributions up in exponent the defines a saddle-point action: 

T = -iVlogdet 2fc (|Vy l [2]x[2] - 2a' m a' n a mn ) + ir 2 v 2 tr 2k (W - £««0 - iVlog(l - M 2 ) . (7.70) 

Now we find the extrema of T with respect to the independent variables a' n , W° and «„. 

Rather than write down the saddle-point equations and proceed to solve them, we will simply write 
down an ansatz for the solution and then show ex post facto from a fluctuation analysis that it is indeed 
an extremum. Up to the auxiliary U(k) symmetry, our ansatz is 

W ° = J-^MxM > ( 7 ' 71a ) 

a' n = -dmg(Xl . . . , X k n ) , (7.71b) 

Uia — . (7.71C) 

These values imply 



N N 

^^l[2 fc ]x[2 fc ] , W = 7^2-l[2 fe ]x[2 fc ] • (7.72) 
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The solution has a simple physical interpretation. It represents k instantons in the dilute-gas limit 
with positions X l n , all with the same scale size inhabiting k mutually commuting SU(2) 

subgroups of the gauge group 55 orthogonal to the gauge orientation of the VEV (since Uia = 0). The 
fact that the instantons live in commuting subgroups of the gauge group implies that they are non- 
interacting and the solution is dilute-gas-like. 

We now turn to an analysis of the fluctuations around the solution (7.71a)-(7.71c). The explicit 
expression for the expansion of action around the saddle-point to quadratic order is 



= 2kN(l + log( 27 rV/iV)) + ^£ J2 S < SW ; 



4j\T v n 

1,7=1 

(7-73) 



i,3=l i=l 



where we have removed the cross terms between fluctuations involving 5W° with u and a' n with u by 
appropriate shifts in 5W° and a' n . At this point, we find it convenient to fix the U(fc) symmetry by 
making the "gauge choice" (2.100) for the fluctuations a! n and then denote the remaining fluctuations 



To leading order in the 1/N expansion, we substitute the saddle-point expressions for the vari- 
ables (7.71a)-(7.71b) into the insertions. Due to the dilute-gas nature of the saddle-point solution, at 
leading order, each insertion Yg^tr^A 2 will be a sum over k one-instanton expressions involving the 
one-instanton collective coordinates {pi, X n , u^, -M'u, Pui, fiui} for a fixed i G {1, . . . , k}. In particular, 
the only dependence on the Grassmann collective coordinates M.' u is through the insertions because at 
leading order they decouple from the fermionic ADHM constraints. Therefore using the one-instanton 
expression (7.15), we can replace each insertion with 



^ ((x-X^ + N/^v 2 ))' 



— tr,vA 2 (*) — J2~ — n . 77± n —, (M'«UM' a ) u . (7.74) 



Even at this intermediate stage in the calculation we can draw an important conclusion. Since, to 
leading order, the insertions do not involve any "off-diagonal" collective coordinates which communicate 
between the individual instantons of the saddle-point solution, the result for the correlation function is 
destined to cluster correctly. Nevertheless, it is interesting to show how this happens explicitly. 

As we have noted, the leading order expression for the insertions are independent of the fluctuations 



55 The generators of the SU(2) cmbcddings of the k instantons in the dilute limit are given in (2.103). The fact that 
the commutators of these generators for different instantons vanishes follows from the fact that the solution above has 
W c = 0. 
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SW^ and a' n ; hence, we can immediately integrate them out: 



We have succeeded in integrating out all the off-diagonal c-number collective coordinates. Now for 
the Grassmann sector. The off-diagonal elements of M.' a do not appear in the instanton effective action 
(6.55) and, at leading order in 1/N, in the insertions (7.74); hence, the integrals over these quantities 
must be saturated by the fermionic ADHM constraints. Since at the saddle point a' n is diagonal, the 
integrals are 

k 2 k 

I n n - x^m'm } = n - . ^ 

i¥=3 i^j 

This conveniently cancels against similar factors in (3.23) and (7.75). 

The integrals that remain are over the positions of the instantons X x n , the gauge orientation co- 
ordinates Zi and the Grassmann collective coordinates {[i iu , p, iu , (M.' a )n, 1C, JC}. The integrals over 
the remaining Grassmann collective coordinates proceeds as for the exact one-instanton calculation 
in §VII.2.2: the {JC, IC} and {f^ U i,fiiu} integrals are saturated by bringing down terms from the instan- 
ton effective action (6.55), as in (7.28) for each %. The integrals over the remaining variables {f^Ni,fiiN} 
are then saturated by the diagonal fermionic ADHM constraints, as in (7.29), for each i. 

Putting all the pieces together, we have the leading order 1/N contribution 



/ „ „ \ », qfc..fc(2JV+l) n -2kN e 2mkr ,,-2kN /u3fc/2 



X 



V TT<! / d^X 1 - r ' dz-z 2 e- Nz > /4 

^ 11 ]l {(x^)-X*) 2 + N/(2n 2 v 2 )Y Jo 



perms 
{i i }of{l,...,fc} 



(7.77) 



Performing the remaining integrals 

(^tr N \ 2 (xU) x -.. x ^trjvAV*))) ^ [n 2N+1 g 2N e^v'^)" (7.78) 

After renormalizing the VEV and matching to the pure gauge theory in the way described the end of 
§VII.2.2, our leading-order large- N result is 

(^trjvAV 1 )) x ... x ^tr^A 2 ^)} ^ A 3fc . (7.79) 
As noted above above, clustering is satisfied at large N. 
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VII. 4 Clustering in instanton calculations 



Taking the SCI calculations in §VII.2.1 and §VII.3.1, compare the kN th -mot of (7.63) with the large- iV 
limit of the iV th -root of the one-instanton result (7.21): 

^A 3 + <D(N~ 2 ) verses ^A 3 + 0(N~ 2 ) . (7.80) 

The implication is that the clustering property of the field theory is not respected by approximating 
the functional integral with an integral over instantons in the confining phase. In Ref. [67], it was 
further shown that the existence of the Shifman-Kovner vacuum designed to reconcile the SCI and 
WCI at the one-instanton level, cannot account for the mismatch (7.80) for general k. We interpret 
the result to mean that instantons in the confining phase by themselves cannot account for all of the 
gluino condensate and that additional non-perturbative configurations must necessarily contribute. This 
conclusion is also supported by numerical calculations of the two-instanton contribution to four-point 
function for gauge group SU(2) which were performed in [67]. 

In contrast the WCI approach is consistent with clustering, at least at large N: compare (7.45) with 
(7.79). But we can argue further that consistency with clustering is guaranteed in the WCI approach, 
even at finite N. The argument proceeds as follows. Suppose one calculated the /c-instanton contribution 
to the /c-point function in the regime where \x^ — ^> g/v, where v is the characteristic scale of 
the VEVs. In other words, the separation between the insertions is much greater than the effective 
cut-off on the instanton scale size imposed by the instanton effective action (6.55). Since each insertion 
can saturate the integrals over two Grassmann collective coordinates, the dominant contribution to the 
integral comes from a region where one of the k instantons lies in the vicinity of each of the k insertion 
points. Choosing the insertion points in this way selects a region of 971^ in which the k instanton 
configuration is completely clustered, as described in §11.4.3. So to leading order we can ignore any 
interactions between them. The collective coordinate integration measure then clusters as in (3.25). 
The factor of l/k\ is then cancelled by the k\ ways of pairing the k instantons and k insertion points. 
Hence, to leading order, for these large separations we have 

(^tr^AV 1 )) x ... x £,tr N \*(xW)) = (if^A 2 )* . (7.81) 

\ / fc-mst \ / 1-mst 

But we have proved in §VII.l that the correlation functions are independent of the insertion points and 
so (7.81) is an exact statement. 

Now that we have established that the SCI approach is inconsistent with clustering it is appropriate 
for us to make some comments regarding this methodology. The usual justification put forward for this 
method can be paraphrased as follows [10,38,63]. The correlation functions of trjyA 2 are independent 
of the insertion points and so we can consider the configuration where the insertions are very close 
compared with the scale of strong coupling effects A. In this case only small instanton configurations 
should contribute to the integral over the instanton moduli space. If this is true then we should be 
able in invoke asymptotic freedom, namely the fact that the integral over the instanton moduli space 
involves the the instanton action factor exp(— 87i 2 /g 2 ), where g runs with the characteristic scale size 
of the instanton, to argue that the calculation is reliable for insertions which are arbitrarily close. 



116 



The fact that the correlation functions are independent of the insertions then allows us to continue 
to large separations. Unfortunately this argument is potentially flawed because, as in is clear from 
(7.21), the integral over the instanton scale size is not cut-off exponentially by the separations between 
the insertions but only by a power law and so large instantons are not adequately suppressed. This 
behaviour should at least question the use of asymptotic freedom as an argument for exactness. 

Although we have only chosen to discuss the gluino condensate the analysis can be generalized to 
any correlation function involving gauge-invariant lowest-components of chiral superfields. For example, 
in the theories with fundamental hypermultiplets, the gauge invariant composite q/qf is also gauge 
invariant and the lowest component of a chiral superfield. So correlation functions of this insertion, 
along with g 2 trArA 2 will, by the Ward identity, be independent of the insertion points and holomorphic 
in the couplings. The SCI verses WCI mismatch that we have witnessed for the gluino condensate 
extends to these correlation functions and the same conclusion applies: only the WCI approaches are 
reliable. The mismatch is particularly striking in the theory with Np = N — 1 hypermultiplets. In this 
case the correlation function 

Q = (^tr N \*(xU)q fl q f ,(xW) X • • • X q^q^Jx™)) (7.82) 

receives a contribution from a single instanton in both the SCI and WCI approaches. In the SCI 
methodology one takes the Higgs to be zero in the bare theory so that the instantons are not constrained. 
VEVs for the scalar fields then arise from instanton effects. On the contrary, in the WCI methodology 
the Higgs VEV is included in the bare theory and the instantons are constrained. The SCI approach 
yields the result [10] 

£sci = ^p e {/ ; },{/;} , (7-83) 

where £{/,},{//} equals 1 if all the f t are distinct and {//} = {//}, otherwise it is 0. The WCI result is 
very easy to establish given the calculation of (-^pi^ 2 ) in §VII.2.2. This is because the insertions of 
the matter fields can be replaced by their VEVs q^q^i — > VfVfSff. Multiplying (7.34) by the product 
of VEVs one finds 

N-l 

Swci = a™+; n • ( 7 - 84 ) 

1=1 

What makes the mismatch between (7.83) and (7.84) particularly striking is that the integral over the 
collective coordinates that yields the WCI result is formally just the limit of the SCI integral with the 
VEVs set to zero: Vf = Vf = as emphasized in the context of gauge group SU(2) in Ref. [76]. So the 
collective coordinate integral is discontinuous at this point. These considerations show that the SCI 
approach yields an incorrect result even when applied in a weakly-coupled Higgs phase. 
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VIII On the Coulomb Branch of J\f = 2 Gauge Theories 



Theories with Af = 2 supersymmetry have an adjoint- valued scalar field and consequently a Coulomb 
branch where the gauge symmetry is broken to the maximal abelian subgroup. For large values of the 
VEVs the theory is weakly-coupled and constrained instanton methods should be reliable. Moreover 
certain holomorphic quantities, and in particular the prepotential described below, are protected against 
quantum corrections (beyond one-loop) and leading-order semi-classical methods should be exact. On 
the Coulomb branch of an M = 2 theory there is a completely different approach to calculating the 
prepotential based on the theory of Seiberg and Witten [68,77]. This remarkable theory predicts in 
a rather implicit way the exact form of the prepotential which can then be expanded in the semi- 
classical regime yielding the sum of a tree-level and one-loop contribution plus a series of instanton 
terms of arbitrary charge. In general closed-form expressions for higher instanton numbers are not 
known, however what has been established are recursion relations which give the higher instanton 
coefficients, in terms of those of lower instanton number. The situation gives us an unprecedented 
laboratory for testing semi-classical instanton methods and all that entails: imaginary-time formalism, 
Euclidean path integral, etc. Actually the result cuts both ways: not only can we quantitatively test 
semi-classical instanton methods in gauge theory, but, in addition, we can subject the ingenious theory 
of Seiberg Witten to stringent tests. 56 

After introducing the notion of the low-energy effective action and the prepotential, we then go on 
to show how the instanton contributions at each instanton number are the centred instanton partition 
function introduced in §VI.5. We then go on to describe the explicit evaluation of this partition function 
for the particular case of gauge group SU(2) with Np flavours of hypermultiplets in the fundamental 
representation for instanton number k = 1 and k = 2. This calculation relies on the fact that the gauge 
group SU(2) is also Sp(l) and the ADHM construction based on Sp(iV), for TV = 1, is more economical 
than that for SU(iV), for N = 2. We then present the one-instanton calculation for general unitary 
groups. The results we obtain are in precise agreement with Seiberg- Witten theory for any number of 
flavours. Nevertheless, the cases with N F = 2N are rather special since there are non-trivial dictionary 
issues to resolve. 

We shall conclude this overture with a brief guide to literature which studied instanton effects in 
Seiberg- Witten models. First instanton tests of pure M = 2 supersymmetric SU(2) theories were per- 
formed in [54] at a one-instanton level and in [25] at a two-instanton level. Two-instanton contributions 
to the prepotential in SU(2) theories with Np fundamental hypermultiplets were calculated in [48,61,78] 
and the general expression for the /c-instanton contribution to the prepotential as an integral over the 
ADHM moduli space was derived in [48]. The relation of Matone [80] between the prepotential and the 
condensate «2 in SU(2) was tested at a two-instanton level in [81] and derived to all orders in instan- 
tons in [82]. SU(iV) gauge theories with and without matter fields were studied at one-instanton level 
in [20,83,84]. In all these cases exact results of Seiberg- Witten and their generalizations [85-90] for the 
low-energy effective action were reproduced exactly for N F < 2N. The case of N F = 2N was considered 

56 We take it as unreasonably perverse that the instanton caleulus and Seiberg- Witten theory could both be wrong 
while being consistent with one another. In any case there are other tests of Seiberg- Witten theory based, for example, 
on the spectrum of BPS states [79]. 
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in [20,91] where it was pointed out that the mismatch between instanton calculations and the proposed 
exact solutions for Np = 2N arises due to a finite (perturbative and non-perturbative) renormalization 
of the coupling constant of the low-energy effective theory. This effect has to be incorporated into 
exact solutions which then agree with instanton results. For a recent careful treatment of these issues 
see [92]. In addition to these effects, explicit instanton calculations are also necessary in order to fix the 
dictionary between the quantum moduli used in constructing exact solutions and the gauge-invariant 
VEVs u n , see [20,58,61,84,91-94] for more detail. A completely new technique for evaluating the in- 
stanton contributions to the prepotential has been pioneered in [95,96] leading to the first calculations 
of instanton effects for all instanton number (beyond the large- N calculations reported in Chapters VII 
and IX) and the first test of Seiberg-Witten theory to all orders in the instanton expansion. We will 
briefly describe this new formalism in §XI.2. 



VIII. 1 Seiberg-Witten theory and the prepotential 

Theories with Af = 2 supersymmetry and gauge group SU(iV) on the Coulomb branch have an N — 1 
complex dimensional vacuum moduli space parameterized by the VEVs (4.63) for the complex scalar 
field 0. At a generic point on this classical moduli space the gauge group is broken to its maximal 
abelian subgroup U(l) Ar_1 C SU(iV) and the theory is in a Coulomb phase. In these circumstances it is 
possible to describe the long-distance behaviour of the theory in terms of a low-energy effective action. 
This can be written in terms of abelian Af = 1 superfields 57 W au = (A mu , X u ) and = (0 n , ip u ): 



>eff 



i- J d i xl^{\r uv ^)W:W av \ e2 + ^ u ^) *t[ 2 - 2 } • (8-1) 



where the VEVs of the scalar components of $ u are identified with g^Pj \/2 of the field of the microscopic 
theory. This low-energy effective action is uniquely determined by the holomorphic prepotential J-{Q) 
[68] as 

dT d 2 T 

Equation (8.1) implies that the matrix of complexified coupling constants of the low-energy U(l) iV_1 
theory has components t uv that depend on the scalar VEVs. 

The electric-magnetic duality of the U(l) iV ~ 1 theory, uncovered by Seiberg and Witten [68,77], 
identifies $d« with the magnetic dual of the original (electric) matter superfield $ M . The vacuum 
expectation values of the scalar component of <3>d« are g<t>vJ V^j with J2u=i ^dm = 0? which provide an 
alternative parameterization of the vacuum moduli space. Both parameterizations, {0°} and {05 u }, 
are not valid globally on the moduli space. Such a global parameterization is provided [68,77] by the 
gauge-invariant condensates {«„} n — 2, . . . , N: 

u n = 2- n / 2 g n (ti N (f> n ) ■ (8-3) 

On a patch of the moduli space where {0°} are a local coordinate system we have 0° = <fi Q u (u n ) while on 
the patch where the dual variables {0d u } are a local coordinate system we have 0^ n = 4>Bu( u n)- The 
57 For each of these superfields we have the traceless condition ^2^=1 Wau — X^=i = 0- 
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analysis of Seiberg and Witten [68,77], and its generalizations, explicitly determine these functions in 
terms of an auxiliary Riemann surface defined by a family of hyper-elliptic curves. 



The effective action specifies the long-range behaviour, compared with the scale l/(g<f)°), of a series 
of correlation functions. In particular, we will focus on a four-point anti-chiral fermion correlator which 
involves the fourth derivative of the prepotential. In Euclidean space 

<At(x (1) )Af 2 (x( 2 ))<(^ (3) )^(x( 4 ))) 

1 f d 4 X^ a (x^\X)S%(x^ 2 \X)S^(x^,X)s\(x^,X) , (8 ' 4) 



2m d^d^d^l 
where S(x, X) is the free anti-Weyl spinor propagator 



The restrictions imposed by holomorphy, renormalization group invariance and the anomaly imply 
that the prepotential has a weak coupling expansion consisting of perturbative piece that is one-loop 
exact and a sum over instanton contributions which are exact to leading order in the semi-classical 
expansion. For SU(iV) with Np < 2N fundamental hypermultiplets 

1 oo 

^ = ^ert + ^-J2^ Ak(2N ~ NF) ■ ( 8 - 6 ) 
k=l 

In the special case Np = 2N, for which the /9-function vanishes, 

N oo 
u=l k=l 

In general, the coefficients Tk depend on the VEVs and hyper mult iplet masses. 

The theory of Seiberg and Witten, extended from the original setting for SU(2) [68], to SU(iV), 
N > 2, in Refs. [85-89], determines the exact form of the prepotential and in particular the /c-instanton 
coefficients jF fc . The construction involves an auxiliary algebraic curve, or Riemann surface, the Seiberg- 
Witten curve}* Once the curve has been identified, the coefficients can, in principle, be extracted. 
In particular, explicit expression for low instanton number have been found in this way [83,90]. For 
arbitrary instanton number, explicit expressions are hard to obtain, however, recursion relations have 
been established that relate instanton contributions at charge k to those of lower charge [80,97-100]. 
These recursion relations are only valid in the asymptotically-free theories with Np < 2N. The finite 
theory with Np = 2N is rather special since there are various re-definitions of the physical quantities 
that have to be taken into account: see Ref. [92]. 

58 In a nut-shell, the matrix of coupling t uv is identified with the period matrix of the curve in a suitable basis. 
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Here, we use the notation of [90] to write down the k — 1 and k = 2 contributions for N F < 2N: 



N 



u=l 

N „ / ,ns ^ / ,ns \' o2 c f J.O'i 



^ 2 - a 3 £r A) w • (8 ' 8b) 



where we have used the function 

N 1 N F 



^^n(^nK+^ ( 8 - 9 ) 



VIII. 2 Extracting the prepotential from instantons 



There are two distinct — but ultimately equivalent — ways of determining the instanton contributions 
to the prepotential. The first involves calculating the leading semi-classical contribution to Green's 
functions whose long-distance limit can be matched with the low-energy effective action (8.1). Here, 
we will concentrate on the four-point anti-chiral fermion correlator (8.4) which involves the fourth 
derivative of the prepotential. However, other correlators which we do not consider explicitly determine 
the second derivative of the prepotential in an analogous way. The second approach involves calculating 
the instanton contribution to the condensate defined in (8.3) and then relating this to the prepotential 
by use of a renormalization group equation which yields the derivative of the prepotential with respect 
to log A (for N F < 2N) [80,101-103]. Alternatively, we will take the view that the instanton calculation 
of the condensate provides an independent way to establish the renormalization group relation. 

For the first approach we turn our focus to the instanton contributions to the four-point anti-chiral 
fermion correlator (8.4). The first point to make is that since there are non-trivial scalar VEVs the 
instantons are constrained as described in §IV.4.1. It will transpire that the integral over the 4- vector 
X in (8.4) arises in the instanton context from the integral over the centre of the instanton (2.82). 
Consequently, in order to extract the long-distance behaviour of the correlator we should insert the 
corresponding behaviour of the anti-chiral fermions A^ = (A, ip) far from the core of the constrained 
instanton. Recall from our discussion in §IV.4.1, that in the tail of the constrained instanton fields decay 
exponentially except for the components which remain massless after the Higgs mechanism. These are 
the fields valued in the U^l)-^ -1 subgroup picked out by the VEV 0°; in other words the diagonal 
elements of the N x N matrices A^. It is precisely these components of the anti-chiral fermions that 
we need in order calculate the long-distance behaviour of the correlator. This is fortunate because, as 
we argued in §IV.4.1, to leading order in the semi-classical expansion these massless components are 
simply given by their ADHM expressions. Moreover, we will only need the components of the anti- 
chiral fermions that depend on the four supersymmetric Grassmann collective coordinates £ A (4.43), 
since these are the only Grassmann variables whose integrals are not saturated by the instanton effective 
action (6.84). It will turn out that each insertion of A^ is then linear in £ B and therefore the insertion 
of the four anti-chiral fermions precisely saturates the integrals over the four Grassmann variables 
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associated to the unlifted supersymmetric zero modes. In principle, we can extract the terms that we 
need from the solution for the anti-chiral fermions in (4.62) (with ipA given in (C.38) and (C.39a)- 
(C.39c)). However, it more straightforward to use the sweeping-out procedure described in §IV.3. The 
supersymmetry transformation (4.23c) (re-scaled by g 1 ^ 2 ) in the super-instanton background gives the 
required dependence on t; A : 

X A = -ig^aAB f<p a i B = -g 1 ' 2 WU , (8.10) 

where in the latter expression we used the relations (4.20). The expression for the anti-holomorphic 
component of the scalar field is given in (6.66) (it is important to remember that in an instanton 
background $ is not the conjugate of 0). We then expand the diagonal components of Ipft (these are 
the massless components) for large distances from the instanton core using the asymptotic formulae in 
§11.4.3: 

If / Nf 1 

{W)uu = - {x _ x)2 w u ^° u h lk]x[k] + L-^wfijFwfi -iJ^JC f K f ) |< . (8.11) 

Plugging this into (8.10), and using the form of the instanton effective action (6.84), we are able to 
establish the following simple relation between the long-distance behaviour of the anti-chiral fermion 
and the instanton effective action: 59 

\° u = 2^gS™(x,X)e AB ^-^ + --- ■ (8.12) 

The ellipsis represent terms depending on other Grassmann collective coordinate which will not be 
required to determine the long-range behaviour of the correlator in question. 

The /c-instanton contribution to the correlation function (8.4) is evaluated in the usual way by 
making insertions of the long-distance component (8.12) into the collective coordinate integral: 



J Mi. 



k(2N-N F ) , (8.13) 

/' \ „2mkr ' 



e 

Here, oj^ n=2 ' n f) i s the supersymmetric volume form on the instanton moduli space in a theory with N F 
hypermultiplets defined in (5.55) and (6.59). The instanton effective action S is precisely (6.84) along 
with the hypermultiplet mass term (6.87). Following [48,82], we substitute the expression (8.12) into 
the right-hand side and perform the integrals over the four supersymmetric Grassmann variables ^ A . 
This leaves 

I k 

zV(- e 2 ^ _/__ / u «^ e - s (8.14) 

x I d 4 ! r t^ 1 ), i)s^(x (2) , i)^(x (3) , i)sV^ (4 U) ■ 



59 



Here, \ u = (A) uu the diagonal elements of A. 
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The integral over the centred moduli space 9Jlk is precisely the centred instanton partition function 
g(M=2,N F ) ^ e £ nec } j n (6103). Note in (8.14) the linearity of § in 0°, apparent in (6.84), has been used 
to pull the 0° derivatives outside the collective coordinate integral. 

Comparing the semi-classical expression (8.14) with its exact counterpart (8.4), we deduce the 
following expression for the /c-instanton expansion coefficient of the prepotential in (8.6): 

One might have thought that we could add to this relation any function whose fourth derivative with 
respect to the VEVs vanishes. Actually, by considering other correlators, one can establish (8.15) up 
to functions whose second derivative with respect to the VEVs vanishes. However, since there are no 
possible linear functions of the VEVs, (8.15) must be true up to an undetermined constant which does 
not affect the physics because the low-energy effective action only depends on derivatives of T . Note 
that in order to write the instanton contributions in the form (8.6) we have substituted the expression 
for the A-parameter of the theory in terms of the running coupling: 

A™- n f = /i 27V -^e 2 ^ . (8.16) 

For the special case when Nf = 2N, where the /9-function vanishes, the factor A 2 ^^ should be 
replaced by q = e 2l7rT . 

The other way to calculate the instanton expansion of the prepotential is to use the renormalization 
group equation for the prepotential first established by Matone in the context of pure SU(2) theory [80] 
and then generalized to SU(AT) in Refs. [101-103]. Rather than write down the relevant equation we 
shall proceed to prove a version of the renormalization group equation using the instanton calculus, 
generalizing the SU(2) calculation of [82]. One may then check that our instanton version is consistent 
with those in the literature cited above. Our relation will also be valid in the finite theory where 
Np = 2N. The approach is rather different from that just follows. Instead of calculating the instanton 
contribution to the long-range behaviour of a correlator, one calculates the instanton contributions to 
the condensate «2- 

To begin, we establish the form of the insertion tr^ 2 in the constrained instanton background. 
Just as in the calculation of the gluino condensate in the M = 1 theories in the Higgs phase described 
in §VII.2.2 and §VII.3.2, to leading order we may replace the insertion by its value in the unconstrained 
ADHM instanton background. This is because any error incurred is necessarily of a higher order in 
g. Just as for the 4-point anti-chiral fermion correlator (8.4), the insertion must saturate the integrals 
over the four supersymmetric Grassmann variables £ A . Since, as is evident from (4.64), the scalar 
field is quadratic in the Grassmann collective coordinates so the composite tr^0 2 is indeed quartic in 
the Grassmann collective coordinates. The dependence on £ A may be obtained by the sweeping out 
procedure using the supersymmetry transformation (4.23d): 

0a = -\g^aABi A 0-mni B F mn H . (8-17) 

Therefore, using (4.20), = 0i — i0 2 , and the identity 

i<y mn ri £o-p q v = -l(£Q{rm) , ( 8 - 18 ) 
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for arbitrary spinors £ and rj, we have 



yhr N <p\ X ) = y\ee)(ee)tr N + ■■■ . (8.19) 

We now insert (8.19) into the collective coordinate integral. Separating out the variables {X, £ A } as in 
(6.102) one can then trivially integrate over the Grassmann variables £ A . The integral over the 4- vector 
X is also trivial because by translational symmetry it may be traded for an integral over the insertion 
point x yielding f d A x ti NF^ n (x) which is the integral that gives the instanton charge (2.2). Hence, the 
fc-instanton contribution to the condensate is 

u 2 \ k = kAf™- NF) g- k{2N - N ^ +2 Z ( f =w . (8.20) 

Notice that the instanton contribution to the condensate u 2 is proportional to the centred instanton 
partition function just as for the expression for the gluino condensate (7.25) in the M = 1 theory. Of 
course in the M = 2 theory the condensate receives contributions from all instanton numbers whereas 
one can easily show that in the M = 1 theory the centred instanton partition function vanishes for 
k > 1. 

Comparing (8.20) with our formula for the prepotential (8.24) we obtain a version of the renor- 
malization group equation in the form of Ref. [82]. The result is most easy to state in terms of the 
/c-instanton contribution: 

u 2 \ k = k\ k{2N ~ NF) T k . (8.21) 

This expression relates only the non-perturbative contributions to both quantities. The rest, however, 
is easy to determine via direct perturbative calculation. In addition to instantons, the prepotential 
receives only 1-loop perturbative contributions, while u 2 receives no perturbative corrections to its 
classical value. It is remarkable that we were able to derive (8.21) using instanton calculus, but without 
actually having to integrate over the instanton moduli space. Note that the result is equally valid in 
the finite theory N F = 2N on replacing A. 2N - n f _^ e ^ir anc j ^ j g a j go f or arbitrary hypermultiplet 
masses. 

At this point we find it useful to perform various re-scalings of the variables by powers of the coupling 
g in order that there are is no explicit g dependence in Z k - The required re-scalings are 

<f>° ^ g-Y , a^^gaa, M A - g 1/2 M A , K - g l/2 lC , K^g^it. (8.22) 

Under these re-scalings, the centred instanton partition function scales as 

g(AT=2,AT F ) _^ ^fc(2Af-Af F )+2£(A^=2,JV F ) 

In particular there is no g dependence in the re-scaled instanton effective action action and the re-scaled 
version of (8.15) is simply 

J= k = z { f= w , (8.24) 
where all g dependence has disappeared (as it should). 
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VIII.3 Gauge group SU(2) 



In this section, we specialize to gauge group SU(2) and compute the one and two instanton contributions 
to the prepotential in order to compare with the exact theory of Seiberg and Witten. For SU(2), there 
is a single VEV 0° = 05/2 = —02/2. The predictions for the one- and two-instanton coefficients are 
obtained from (8.8a) by setting N = 2. 

Since the gauge group SU(2) is isomorphic to Sp(l) one can proceed with either of the two formalisms; 
however, as described in §VI.2, the ADHM construction for Sp(l) is more economical in the sense that 
for a given k their are fewer ADHM variables and constraints compared with the SU(2) formalism. 
Hence, the Sp(l) formalism is better suited for a direct calculation of the prepotential. 

Almost all of the formulae that we established in the SU(iV) instanton calculus case carry over to 
Sp(l) by simply imposing the extra restrictions (6.32) and (6.33) on a A and M A , respectively. Recall, 
once we have replaced the gauge index it = 1,2 by a spinor index a = 1,2, the ADHM variables Waui 
become quaternions w a ai- From (6.36) 

' ' w 



(8.25) 

a 1 

where w is a A;-vector of quaternions and a' is a k x k symmetric matrix whose elements are quaternions. 
In (8.25) and in much of the following the quaternion indices are understood. Products of quaternions 
are defined as 2 x 2 matrix multiplication. We also define the quaternion inner product 

x ■ y = \tr 2 (xy + yx) , \x\ 2 = x ■ x = x n x n . (8.26) 

For the Grassmann collective coordinates 

<a _ I ^ 



M A = ^ lA j , (8.27) 

for A = 1,2, where /i A and M.' A are Weyl spinor-valued /c-vectors and k x k symmetric matrices, 
respectively. 

The collective coordinate integral will involve the volume form on the instanton moduli space which 
is given by an obvious translation of the general SU(iV) formula (5.55): 



r- WAf=2) r. k 2 k k 

L = 4i / n ^ rF«« { n rF«* n 

J Mh \ ) J ,v-o 4—1 i— 1 oVo — 1 



K ' J »=1 i<3 A=l i=l i<j=l , . 

k , 3 2 2 - {<■■-<■) 

x |detL|" 1 J] | HS^tikT^^a^ij - (a^a^)) J] J] 5((M A a^ - {M A a*) 3l ) 

i<j=l ^ c=l A=l a=l 

In this expression, integrals over a quaternion w are defined as f d 4 w = J dwi dw 2 dw 3 dw^. Note in 
the one-instanton sector the 5-functions are absent (unlike the SU(2) formalism). In this formalism L 
is now a linear operator on the space of k x k antisymmetric matrices but still defined as in (2.125). 
Finally, the normalization constant is fixed by using clustering and comparison with the one-instanton 
collective coordinate integral in [2]: 

C (M=2) = 2 5k-k^ n ~4k _ ( 8 29 j 



125 



The leading order expression for the instanton effective action of the M = 2 theory with N F funda- 
mental hypermultiplets can be read of the expression in the SU(iV) theory (6.84). In addition, we add 
the mass terms (6.87). The collective coordinate integral also includes integrals over the hypermultiplet 
collective coordinates {/C, /C} in (6.58). Consider the complete expression for the collective coordinate 
integral in the chiral limit, rrif — 0. In this limit, for fixed flavor index /, the Grassmann measure (6.58) 
is obviously even or odd under the discrete symmetry 

Kif <-> JCfi , (8.30) 

depending on whether k itself is even or odd. On the other hand, the instanton effective action (6.84) 
(without the mass terms) is always even under this symmetry. Therefore, for Np > 0, only the even- 
instanton sectors k — 0, 2, . . . can contribute in the chiral limit 60 (recall that when N F = all instanton 
numbers contribute). This selection rule was already noted by Seiberg and Witten in Sec. 3 of [77] so it 
is satisfying to see it arising naturally in the instanton calculus. Of course it is violated once the masses 
are non-zero, since JCifJCfi is odd under the symmetry. 



VIII. 3.1 One instanton 



Recall that in order to calculate instanton contributions to the prepotential we have to calculate the 
centred instanton partition function (8.24). We begin with the one-instanton case. In ADHM language, 
the bosonic and fermionic parameters of a single M = 2 super-instanton are contained in three 2x1 
matrices of unconstrained parameters: 

-*)■ ^ = (-£")' (8 ' 31) 

In addition, there are 2Np Grassmann variables {JCf, JCf} which parameterize the fundamental zero 
modes. The centred-instanton volume form is extracted from (8.28): 

r 2 3 c 2 Np 

i = / ] i n dic f dt f > ( s - 32 ) 

Jm ± 71 J A=l f=l 

where the instanton effective action for k — 1 is easily deduce from (6.84) and (6.87): 

Np 

S = 8tt 2 M 2 |0T - 2m 2 n + tt 2 m f^f^f ■ ( 8 - 33 ) 

/=i 

Notice that the only dependence on JCf and }Cf comes exclusively from the mass term at the one- 
instanton level. The corresponding Grassmann integrations are easily saturated by bringing down 
appropriate powers of this term from the exponent. As expected from the discrete symmetry (8.30), 
the result is non-zero only when all the mj are non- vanishing. The remaining integrals are easily 
performed, yielding 

9 N F 



60 The absence of a one-instanton contribution is quite easy to see since {JCf, JCf} completely decouple from the instanton 
effective action (6.84) and consequently their integrals remain unsaturated. 
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Using (8.24), this gives T\ which is in agreement with (8.8a) for N F < 4, up to irrelevant VEV- 
independent constants for the cases N F = 2, 3. 



VIII.3.2 Two instantons 



Next we turn to the more calculationally intensive 2-instanton contribution. The parameters of the 
k = 2 ADHM super-instanton are contained in the following 3x2 matrices: 

/ W 1 W 2 \ ( Hi ^2 \ 

a = \-X + a 3 a x , M A =\ -Ai£ A + M lA M[ A . (8.35) 

\ ai -X-a 3 J \ M' A -Aii A -M lA ) 

Each element of a is a quaternion and of M. A a Weyl spinor. In addition, there are now AN F fundamental 
zero modes parameterized by the Grassmann numbers /Q/ and AC/j. We also define the following 
frequently occurring combinations of these collective coordinates: 

L = |wi| 2 + \w 2 \ 2 , H =\w 1 \ 2 + \w 2 \ 2 + 4|ai| 2 + 4|a 3 | 2 , 

= W 1 W 2 — W 2 Wi , 00 = \4Ptl 2 (w 2 T Z Wi — WiT 3 W 2 ) , 

(8-36) 

Y = -tfpu - 2M' 3 A M' 1A , Z = J2 {iCifKf* - KtfKfi) ■ 

/=i 

For k = 2 L is just multiplication by the quantity H. In terms of these variables the instanton effective 
action (6.84) (with the mass term (6.87) and re-scalings (8.22)) is written 

S = 8tt 2 L|0°| 2 - 27rV t (/^ 1A - i4ix 2A ) 

o _2 /o o>j\ 

- — (Q - \Z) (u - jy) + vr 2 m, {K lf K n + K v iC f2 ) . 

/=i 

We can now proceed to evaluate the centred instanton partition function. To start with, we can 
explicitly solve the bosonic and fermionic ADHM constraints. This is conveniently done by eliminating 
the off-diagonal elements a x and M' A , as follows: 

ai = 7T~T2 "3(^2^1 - wiw 2 ) , M' A = — ^ a 3 (2a 1 M'^ + w 2 fif - wifi 2 ) , . (8.38) 

4 1°3 1 ^\ a 2,\ 

We can now explicitly integrate out the (^-functions in the expression for the centred k = 2 instanton 
volume form (8.28): 



/ = ^2 / ^ ^ ^ | j-J d 2 M ,A £fl A rf2/i A| |l a 3| l«H | (8 ^ g) 



The bosonic parts of the 2-instanton collective coordinate integral was originally derived in Refs. [8, 
104, 105] by directly changing variables in the path integral. 
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We now have a series of integrals to perform. First of all, performing the Grassmann integrals over 
the parameters of the adjoint zero modes gives [48] 



/ 



{ n d 2 M' 3 A exp (0°t(^ 1A - ^ 2A ) + I (Q - \Z)) } 

A=l 



+ lH-^-\Zf(\(^f(Li-\tt\i)+J)} . 

This is the generalization to Np > of the Yukawa determinant given in Eq. (8.13) of [25]. The next 
step is to integrate over the Grassmann collective coordinates {/C, IC} using the identity 



, N F 

/ d 2NF IC d 2NF iC G(Z) exp ( - n 2 m f {KifKfi + £2/^/2) ) 

/ 1 



5.41) 



z=o 



where 

M (^) def ^ m 2 im | . . . m 2 (M (- F ) = x ^ M (^) = q l < Q) (g>42) 

/l</2<-</i=l 

Finally we turn to the remaining integration over the bosonic moduli. Following [25], it is convenient 
to change variables in the bosonic measure from {0,3, wi, W2} to the new set {H, L, Q} defined in (8.36). 
The relevant Jacobians are 



f°° A 03 - 01 TV 2 f 00 

/ d% " 31 ' 11 1 — — / dH, (8.43a) 

J-oo \ a 3\ 1 J L+2\Q,\ 

L 



00 3 r°° 

4 ./i " 



d* Wl d 4 w 2 — > — dL d 6 Q . (8.43b) 

-00 ° JO J\Q\<L 

The numerator and denominator in the left-hand side of (8.43a) are supplied by (8.39) and (8.40), 
respectively. In addition, we introduce re-scaled variables = Lil', H = LH', and uo = Luo' . The 
integral over L is now trivial. Finally we switch to spherical polar coordinates, 

J d 3 Q' — ► 2tt J d(cos9) J \Q'\ 2 d\n'\ , (8.44) 

where the polar angle is defined by = ||J1'||0°| cos#. This leaves an ordinary 3-dimensional integral 
over the remaining variables H, cos# and which is the precise analog of Eq. (8.19) in [25]. Performing 
this integral with the help of a standard symbolic manipulation routine gives results which can be 
summed up by the formula 



5 <T - 77^<?^T^<?i 2 - 4« + , (8-45) 



Jk N=2,N F ~ (00)6^^ ~ ip) 1 1V± N F -1 16(^0)2 lvl N F -2 ~ 7^^"N F -3 t ~1^^N f -A , 
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where the coefficients M ( t NF) are defined in (8.42). For N F < 4 these expressions are identical to the 
predictions from the Seiberg-Witten curve (8.8b), up to a physically unimportant additive constant in 
the case N F = 3. 

The situation in the N F = 4 theory is rather more subtle. To our knowledge the two-instanton 
prediction from Seiberg-Witten theory with arbitrary hypermultiplet masses has not been determined 
in the literature. However, in the massless case ones expects that the prepotential is classically exact: 
in other words given by the first term in (8.7). The result (8.45) for N F = 4 with vanishes masses 



T 2 



Q\2 



_ 7(0°) 

7V F= 4,m /= 2 8 3 5 



(8.46) 



is obviously in contradiction to this. The resolution of this discrepancy is explained in Refs. [48,91]. 
The point is that the Seiberg-Witten curve is parameterized by an effective coupling r c g rather than 
by the microscopic coupling r. The two definitions differ by an infinite series of even-charge instanton 
corrections: 

r cS = r+ c k e 2mkT . (8.47) 

fc=2,4,... 

The two-instanton computation above shows that 



VIII. 4 One instanton prepotential in SU(N) 

In this section we perform, following [20], the explicit evaluation of the centred instanton partition 
function for k — 1 in an theory with arbitrary gauge group SU(iV). It is best to start from the 
linearized formulation described in §VI.5. The instanton effective action for k — 1 is, from (6.96) (with 
factors of g removed by the re-scalings (8.22)): 

Np 

S = 4tt 2 { \w u6lX + 0Va| 2 + f/^Wx* + €Vua) + \Y J K f £f(x-mf)}+ S L .n, , (8.49) 

/=i 

where the Lagrange multiplier terms for the ADHM constraints are 

5 L . m . = -4i7r 2 {^(//> ud + w^nt) + D ■ ^pwiw u ^ . (8.50) 

Note that the quantity x for /c = 1 is just a complex variable rather than being a matrix. The linear 
shifts 

A A ^^ua jaA -A -A , ^Wua ja,A / o n\ 

K -»• K . K K + —r^ ■ ( 8 - 51 ) 

or or 

eliminate the linear terms of these variables in the action. The Grassmann integrals over {/i^, /x„ } then 
bring down the factors 

N 2 

n(2vr 2 <) , (8.52) 

u=l 
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where we have defined 

a u = X + <P° u , a* = x* + <P° u *- (8.53) 
The Grassmann integrals over the matter field coordinates {/C/,/C/} are simply evaluated: 

/N F N 
^/C^exp(-7T 2 ^/C / ^ / ( X -m / )) =n 2N -H(m f - X ) . (8.54) 

/=i /=i 

The integrals over {w U a,w^} are Gaussian and are accomplished using the identity 

f N 1 
/ d 2N w d 2N w exp ( - Arr 2 A u wtw UCi + Am 2 B u ■ ^^wj) = {2-k)' 2N J] — ^ . (8.55) 

J 7 u=l A u + B u 

All that remains are integrals over the auxiliary variables {x, D, ^a}'- 

2 N *2 N F 



j^(JV=2,JV F ) 



= w n*** n KI<+ ^ + , )a iiK - x) . im 



where E u is the Grassmann bilinear 



S«=K) #f*V2- (8-57) 
The integrals over ^ must be saturated with two insertions of S: 

5 cd 



/n^ S ^ = -8^, (8.58) 



leading to the identity 



/n W) - ^gL ^ -ff ^ • <"» 



where, in our case, 



N 



F(E) = TT ^ . (8.60) 



We are now in a position to integrate out the Lagrange multipliers D. The non-tr ivial p art of the 
integral is easily performed by a standard contour integration in the variable \D\ ■ D extended 

to run from — oo to +oo: 



N *2 N * N *2 

J-l \„. 14 i n2 ^ n Al U 4 _ U 4 V ' 



£ 2 ^ K| 4 + £ 2 ^ «« ti Kl ~ !«• 

Finally, it only remains to integrate over x' 

Z ( t 2 ' NF) = -7T- 1 J d\ J^MX, X*)f2(x) , (8.62) 
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where we have defined the two quantities 



The resulting integral over the x _ pl ane can be evaluated by Stoke's Theorem. There are two kinds of 
boundary to consider: on the sphere at infinity and around the singularities of the integrand. First the 
singularities. Contrary to appearances the integrand is completely regular at \a u \ 2 — \a v \ 2 due to the 
cancellation between the u th and v th terms in the sum. However, there are N singularities at a u = 0, 
i.e. x — ~ 0m! following from the fact that for z = x + iy 

~ = nS(x)5(y) . (8.64) 

oz* z 

Introducing polar coordinates in the vicinity of the point x = ~ ( Pu> a u — re 10 , we take a boundary in 
the form of a small circle of radius r — > 0. The resulting contribution is then 



limi-(2 + r|-) J~" Me™h{r,6)f 2 {re*) . (8.65) 

Applying this formula in the vicinity of x — we find a contribution to the centred instanton 
partition function of 

N N F 

n M r W n i (-/+*!)- <*«» 

We can determine the contribution from the sphere at infinity by once again introducing polar 
coordinates. Then in a similar fashion to (8.65) the contribution is 

-^4^( 2 + r ^)y o d9e™ fl (r,e)f 2 (re*). (8.67) 



It is easy to establish the following asymptotic forms for large r 

-2iNe 

/ifr °) ~ > /*(™*) ~ e w 'V*' • (8.68) 

This means that the contribution from the sphere at infinity is only non- vanishing when Np > 2(JV — 1), 
ie. in the three cases iVV = 2N — 2, 2iV — 1, 2N. Taking into account the selection rule arising from the 
integration over 0, there are only two relevant terms in the asymptotic expansion of /i(r, 0), namely 

-2iN0 -2i(N+l)0 N 
/l(r , 0) ~ «1 2 (JV-1) + "2 2N~ 

«=1 

where we have defined the two constants 

^W^jf), (8,0) 
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All other terms are either sub- leading in 1/r or have the wrong 9 dependence (taking account of the 
fact that f'2 is a polynomial in e td along with the factor of e 2td in (8.67)). Hence, the contribution to 
the centred instanton partition function from the sphere at infinity is 



c( n f) 

*>k=l 



ro 



- < 



EN 
f=l m f 

«i E5=i m .f m f + "2 T,Z=M) 

(/</') 



0\2 



Np < 2N — 2 , 
Nf = 2N — 2 , 
Np = 2N — 1 , 
Np = 2N . 



(8.71) 



All-in-all, the k — 1 contribution to the centred partition function, and hence the coefficient T\ of the 
prepotential, is the sum of (8.66) and (8.71): 



N N 

=eii 



(JV F ) 



(8.72) 



/=i 



This calculation illustrates the complexity of integrating over the instanton moduli space even at 
the one instanton level. However, if one follows the method in detail an interesting intuitive picture 
emerges. First recall that the final integral over x was evaluated by using Stoke's Theorem: there are 
only contributions from an isolated set of iV points along with a contribution from the sphere at infinity. 
We now show that these contributions arise at the critical points of the instanton effective action (8.49). 
The latter correspond to the vanishing 

w„X + <P°w« = • (8.73) 

So there is a branch consisting of iV solutions, labelled by u, where 

X = ~<t>u > w va oc 5 UV , (8.74) 

i.e. precisely at the positions of the isolated contributions to the instanton effective action. However, it 
is apparent that these are critical points of S only on a branch where the instanton has non-zero scale 
size: p 2 = ^w a Wa > 0. There is another branch of solutions when p — 0, i.e. «)„ = where physically 
the instanton has shrunk to zero size. This second branch is associated with the contribution from 
the integral over the sphere at infinity in x-space. To see this notice that when x is eliminated via its 
"equation-of- motion" one has x ~ P 1 '- so p ^ does indeed correspond to the circle at infinity in 
X-space. These observations are intended as anecdotal evidence in favour of some kind of localization 
on the moduli space of instantons. This is subject that we will pursue in earnest in §XI.2. 
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IX Conformal Gauge Theories at Large N 



Instanton calculations are only reliable at weak coupling which can be achieved in a Higgs or Coulomb 
phase, giving rise to the applications reported in Chapters X.l and VII. However, theories can be 
weakly-coupled without invoking the Higgs mechanism if they are finite, or conformal. In this case the 
gauge coupling does not run with scale and weak coupling prevails for small g. The two main examples 
that we discuss here are the M = 4 theory and the Af = 2 theory with N F = 2N hypermultiplets 
in the fundamental representation of the gauge group. We shall see that these two examples have 
some interesting features in common and, in particular, the calculation of instanton effects in both 
cases considerably simplify in the large N limit (see Refs. [21, 106, 107]). The most striking feature 
of this limit is that the instanton measure concentrates on a subspace of the moduli space, which, 
in the M = 4 case, is essentially AdS^ x S 5 . Of course this phenomenon is directly related to the 
AdS/CFT correspondence, which we discuss in §IX.3. The approach of Refs. [21,106,107] that we will 
describe has been generalized to other finite gauge theories. Ref. [108] considered the M = 4 theory 
with gauge groups Sp(iV) and SO(iV); Refs. [109,110] considered finite M = 2 gauge theories with gauge 
group Sp(iV); and Ref. [Ill] considered finite M = 2 theories with product gauge group SU(iV) fc and 
hypermultiplets in bi-fundamental representations of the gauge group — the "quiver" models. 

The main common feature of finite gauge theories is the fact, discussed in §IV.3, that the super- 
symmetric instanton with the full set of fermion zero modes turned on is not an exact solution of the 
equations-of-motion. Put another way, most of the fermion zero modes that appear at linear order 
around the instanton background are lifted by Yukawa interactions as evidenced by the leading order 
expression for the instanton effective action (5.25) or (6.84) (with 0° = in the present context). In 
both cases, this effective action involves a quadrilinear coupling of the Grassmann collective coordinates. 
Of course, in both cases, the supersymmetric and superconformal zero modes are protected by symme- 
tries and remain unlifted by the instanton action. The pattern of lifting dictated by the Grassmann 
quadrilinear implies that instantons of all charges k will contribute to a class of correlation functions 
whose insertions evaluated in the instanton background are responsible for saturating the integrals over 
a finite number of unlifted modes. In both cases, the number of unlifted fermion modes is sixteen. 
This is directly the number of supersymmetric and superconformal modes in the M = 4 theory while in 
the M = 2 theory it includes the supersymmetric and superconformal modes — numbering eight — along 
with eight additional fundamental fermion zero modes. 

The aim of this section is show how instanton contributions to these kinds of correlation functions 
can be calculated in the large-iV limit. They will turn out to have remarkable properties. The only 
dependence on the instanton charge appears in an overall numerical pre-factor: 

oo 

(O&W) x ... x O n (,("))} inst (J^k n c k e^)f{x^\...^) , (9.1) 

k=i 

for some function / and coefficients independent of the correlator in question. At first sight, the 
limit (9.1) looks absurd. How could it be that the only dependence on the instanton number k is 
via an overall multiplicative factor? The reason for the simple dependence on k rests on certain very 
special properties of the large- N limit of the instanton calculus. Intuitively what happens in the large- 
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N limit is that integral over the /c-instanton moduli space is dominated by configurations of k single 
instantons occupying k commuting SU(2) subgroups of the gauge group which are therefore totally 
non-interacting. In this sense the dominant configurations are dilute- gas-like. But, contrary to the 
dilute gas, the dominant configuration also has all the instantons lying at the same spacetime point 
X n and having the same scale size p. This accounts for the fact that the functional dependence on the 
insertion points is the same for all instanton number. We will show that the /c-dependent numerical 
factors Ck are related to interesting matrix integrals; namely, the partition functions of dimensionally 
reduced gauge theories. 

In addition, when taking the large- N limit, it is convenient to introduce auxiliary bosonic collective 
coordinates to bi-linearize the Grassmann quadrilinear in the instanton effective action. In the large- 
N limit these additional variables are confined to a sphere: S 5 for M = 4 theory [21] and S* 1 for 
M = 2 [107]. The appearance of these auxiliary collective coordinate is especially interesting in the 
light of the AdS/CFT correspondence [112, 113] relating the M = 4 theory to Type IIB string theory 
on AdS$ x S* 5 as discussed in §IX.3. 

IX. 1 The collective coordinate integrals at large N 

We begin our analysis by establishing an expression for the instanton partition function in the large- N 
limit. There are two relevant cases: M = 4 and M = 2 with Np = 2N hypermultiplets. The expressions 
for the collective coordinate integral (5.14) is obtained from (5.55), with S equal to (5.25), in the M = 4 
theory and (6.84), for the Af = 2 case. 61 Since we are interested in pursuing a large- N limit, we can use 
the results described in §VI.l to resolve the ADHM constraints and therefore use the explicit version 
for the supersymmetric volume form in (6.29). Note that the requirement N > 2k will certainly be met 
in the large- N limit (for fixed k). Since we are working in a non-abelian Coulomb phase all the VEV 
vanish and the gauge symmetry remains unbroken. In this case, we can immediately integrate over the 
gauge orientation of the instanton: J d 4k ( N ~ k ^U = 1. 

The key to taking a large- N limit is to bi-linearize the Grassmann quadrilinear effective action (5.26) 
or (6.84), by introducing some auxiliary variables. This kind of transformation is a well-known tool for 
analyzing the large-iV limit of field theories with four-fermion interactions, like the Gross-Neveu and 
Thirring models [114]. In fact we have already introduced the necessary auxiliary variables in the form 
of the k x k matrices Xa in §VI.5. In particularly the linearized instanton effective actions (6.91) and 
(6.96) involve only Grassmann bilinears. 

61 The VEVs are to set to zero in both cases to preserve conformal invariance in the present applications. 
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In the J\f = 2 theory the identity we use is 

2 N F 



det k 2L\ *exp |^~y^tr fc KfKLfL 1 M A M A } 

/=i 

= 2 2k \ k2 J d 2k \exp{-An 2 (tT k XaLXa + ^T k M A M A x j + ^tT k IC f IC f x)} , 



whilst in the Af = 4 case 

2 

Idet^Xl-'expj^eAB^tr^^^^L- 1 ^^^) 

- L 2 J (9.3) 

= 2 6k \ 3k2 J d 6k2 X exp{-4n 2 (tr kX aLxa + ^ aAB tr k M A M B x a )} . 

Notice in both cases that the appropriate factor of (detail is already present in the supersymmetric 
volume form on DJl k (5.55) (or (6.29)). 



IX.1.1 The M = 4 case 



Before we proceed, and with the large- N limit in mind, it is useful to re-scale 



Xa -> -Tf^Xa • (9.4) 

It is also convenient to define 

XAB = ^^aABXa , Xa = — ^^XAB (9.5) 

which satisfies a pseudo-reality condition following from the Hermiticity of Xa- 

X\ B = \e ABCD XGD , (9.6) 
where f only acts on the instanton indices rather than the SU(4) i?-symmetry indices. 

After integrating over the gauge orientation and using the identity (9.3), the instanton partition 
function is 



o3fe 2 Ar3fc 2 ri(.V=4) a r 4 



A=l 

x Idetofr W I N ~ 2k e~ NtIk xaL Xa +Vm-ntr k m a m b X ab _ 



(9.7) 



In this expression, and in the following, W is the 2k x 2k matrix defined in (6.1), where the ADHM 
constraints (6.3) are imposed; hence, 



W = \W\ 2]x[2] -2a' m a' n( j mn . (9.8) 



135 



The next stage in the large- N program is to to integrate out the Grassmann variables {u A ,u A } by 
pulling down powers of the second term in the exponential in (9.3): 

I ( f[d k ^ N - 2k ^ A d^ N - 2k ^ A \ e W V8Nrrtr k u A u B XAB = (8N7r 2 ) 2kiN - 2k) (det^xl^ , (9.9) 

•' ^ A=l ' 

where the determinant is over the 4k x 4k dimensional matrix with elements (xab)^- One might think 
that it is rather premature to integrate out the Grassmann variables {z/ A , u A } since they could be 
saturated by insertions made into the functional integral. This is possible, but we shall find later a very 
simple prescription for including this kind of dependence when working at large N. 

After integrating out the subset of Grassmann variables as above, we can collect together all the 
non-constant terms in the collective coordinate integral which are raised to the power N as exp —NT. 
The large- N "effective action" is 62 

T = - log det 2fc W - log det 4 fcX + tr fc XaLxa ■ (9. 10) 

This expression involves the Ilk 2 bosonic variables comprising the eleven independent k x k Hermitian 
matrices W°, a' n and Xa- The remaining components W c , c = 1,2,3, are eliminated in favor of the a' n 
via the ADHM constraints (6.3). The action is invariant under the U(k) symmetry (2.64) which acts 
by adjoint action on all the variables. 

With N factored out of the exponent, the measure is in a form which is amenable to a saddle-point 
treatment as N — > oo. The critical points of T satisfy the equations 

e ABCD (x . Xab) = 1 5 D 1[fc]xw ^ (Q Ua) 

XaXa = \{W- l f , (9.11b) 

[ Xa ,[Xa,a' n ]] = tT 2 {T c a nm ){{W- l y,a' m ] . (9.11c) 

These are obtained by varying Y with respect to the matrix elements of x, W° and a' n , respectively, and 
rewriting "logdet" as "trlog." We have defined the k x k matrices 

(W- 1 ) = tr 2 W- 1 , (W- 1 )" = tr 2 (t'W- 1 ) . (9.12) 



The general solution to these coupled saddle-point equations is easily found. It has the simple 
property that all the quantities are diagonal in instanton indices: 

W° = diag(2p 2 ,...,2p 2 ) , (9.13a) 

Xa = diag(pr 1 ^,...,Pfc 1 ^) , (9.13b) 
a' n = diag(-Xi,...,-X n fc ) , (9.13c) 

up to adjoint action by the U(k) auxiliary symmetry. For quantities Cl l a parameterize k unit six-vectors, 
or points in S 5 , 

ttitti = 1 (no sum on i) , (9.14) 

62 In the following we translate back and forth as convenient between the antisymmetric tensor representation xab and 
the SO(6)i? vector representation \ a , a = 1, ... ,6 (9.5). 
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where the radius of the i th S 5 factor is p i 1 . 

A simple picture of this leading-order saddle-point solution emerges: it can be thought of as k 
independent copies of a large-iV one-instanton saddle-point solution, where the i th large- TV instanton 
is parameterized by the triple {X^, Pi,& l a }- Additional insight into this solution emerges from consid- 
ering the SU(2) generators (T?) uv describing the embedding of the i th instanton inside SU(iV). From 
Eqs. (2.103), (6.2) and (9.13a) one derives the commutation relations 

[Tf,^] = 2i5 l0 e abc T? (9.15) 

so that at the saddle-point, thanks to the Kronecker-5, the k individual instantons lie in k mutually 
commuting SU(2) subgroups. Actually this feature follows intuitively from large- N statistics alone, 63 
and has nothing to do with either the existence of supersymmetry or with the details of the ADHM 
construction. Another important property is that the effective action (9.10) evaluated on these saddle- 
point solutions is zero; hence there is no exponential dependence on N in the final result. Finally 
we should make the technical point that, thanks to the diagonal structure of these solutions, they 
are automatically consistent with the triangle inequalities on the bosonic bi-linear W discussed in the 
paragraph following Eq. (2.84); hence we never need to specify more explicitly the integration limits on 
the W variables. 

The saddle-point solution depends on the moduli of each of the k large- N instantons. However, not 
all these 10k parameters are the genuine moduli, or flat directions of T, because, as we shall see, they 
are "lifted" by terms beyond quadratic order in the expansion of T. In fact the only genuine moduli 
are the 10 overall large- N coordinates. The correct large- N behaviour is obtained [21] by expanding T 
around the maximally degenerate saddle-point solution 

= 2p 2 l[ fc ] x[fc ], Xa — P "a l[fc]x[fc], a n = ~X n l[*]x[k] i (9.16) 

which corresponds to the k large- N instantons living at a common point {X n , Cl a , p). (From the ADHM 
constraint (6.3) it follows that the remaining components of W vanish: W c = for c = 1,2,3.) 
This degenerate solution, unlike Eqs. (9.13a)-(9.13c), is invariant under the residual U(fc). With the 
instantons sitting on top of one another, it looks like the complete opposite of the dilute instanton 
gas limit; however the instantons still live in k mutually commuting SU(2) subgroups of SU(A^) as per 
Eq. (9.15), which is a dilute-gas-like feature. 

In order to expand about this special solution, one first needs to factor out the integrals over the 
parameters {X n , p,Q a }. This is done in the following way: for each k x k matrix, we introduce a basis 
of traceless Hermitian matrices T r , r — 1, . . . , k 2 — 1, normalized by tr^ T r T s = 5 rs . For each k x k 
matrix v we separate out the "scalar" component v by taking 

v = v l [k]x[k] +v r f r . (9.17) 

The change of variables from the {T r } basis used in (9.7) to {l [fc]x[fe] ,T r } involves a Jacobian 

J d k \ = k ±1/2 J dv d k2 ~ l v, (9.18) 

63 Consider the analogous problem of k randomly oriented vectors in ~R N in the limit N — > oo; clearly the dot products 
of these vectors tend to zero simply due to statistics. 
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where ±1 refers to c- number and Grassmann quantities, respectively. For the moment we continue to 
focus on the bosonic variables, which are decomposed as follows: 

a 'n = ~ X n l[k]x[k] + o! n i (9.19a) 

Xa = p _1 fial[fe]x[fe] + Xa • (9.19b) 

By definition the traceless matrix variables a' n and Xa are the fluctuating fields. 

Inserting Eqs. (9.19a)-(9.19b) into Eq. (9.10) and Taylor expanding is a tedious though straightfor- 
ward exercise. It is necessary to expand to fourth order in the fluctuating fields around the solution 
parameterized by the ten exact moduli. The expansion of the determinant terms in (9.10) is facilitated 
by first writing "log det" as "tr log" and then expanding the logarithm: 



tv 2k W = 2A;logp 2 + \tv k (5W°) - -l-tT k (5W ) 2 
p z 4p 4 



(9.20) 



and 



tr 4fc log X = -2Hog(8p 2 ) - 2 5 p 2 tr 4fe (fi*x) 2 + ^ftr 4k (n* X ) 3 - 2 w pHr 4k (frx)* + • • • . (9.21) 

In these expansions we have dropped fifth- and higher-order terms in the fluctuating fields. Here we are 
anticipating the fact that these terms are not needed at leading order in 1/N. To see this we can re-scale 
the integration variables in a standard way which shows that the fluctuations around the maximally 
degenerate saddle point are of order N~ l / A . The higher order terms in the exponent therefore yield 
sub leading contributions in the 1/N expansion. In particular, this is true for the diagonal components 
of b! n and Xa which correspond to the moduli of the generic saddle-point solution discussed earlier in 
this Section. This shows that our large- N expansion around the maximally degenerate saddle-point 
is self-consistent. In Eq. (9.21), and in subsequent equations, we move back and forth as convenient 
between the 6-vector and the antisymmetric tensor representations of Cl and x using Eq. (9.5). In 
particular, the SO (6) orthonormality condition Ct • — 1 becomes, in 4 x 4 matrix language, 

nn* = -|i [4]X [4] , or ir 1 = sn* , (9.22) 

which has been implemented in Eq. (9.21). 

Next we need a systematic method for re-expressing the traces over 4k x 4k matrices in Eq. (9.21) 
as traces over k x k matrices. We will exploit the following "moves" : 64 

Cl*x = -x ] (l-\^-x)l { , Ym , (9.23a) 
tr^E^F — tr^F^E — —\{E ■ F) , (9.23b) 
tr 4 E^FG^H = i (E a F a G b H b - E a F b G a H b + E a F b G b H a ) . (9.23c) 



64 In the following, we should emphasize that t only acts on instanton indices, as per the reality condition (9.6), and 
not on SU(4) matrix indices. 
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On the left-hand sides of Eq. (9.23b)-(9.23c), the 4k x 4k matrices {E, F,G, H} are assumed to be 
antisymmetric in SU(4)^ indices and subject to the usual conditions (9.6)-(9.5); the identity (9.23a) 
follows from a double application of Eq. (9.6). Using Eqs. (9.22)-(9.23c) in an iterative fashion, it is 
then easy to derive the following trace identities: 

tr Ak (n*x) 2 = -tr Ak x^n*x-\tr k (n-xtr 4 (n*x)) 

= ^tT k (n-x) 2 -^tr k x-X, (9.24a) 

tu k (n*x) 3 = lt Uk (x^Wx) + ^tT k (n-x) 2 x-x-^^-x? 

= -^tr k (n-x) 3 + ^tr kX -xti-X, (9.24b) 

tr 4 fe (^*x) 4 = fo4k(iiX*xx*x - hx'xi^ ■ xWx 

- • m*xx ] x + u& ■ x)&m ■ m*x) 

1 - 4 1 - 2 1 2 1 

= ^j tr k • x) - ^tr fc (Q ■ x) X-X+ ^tr fe (x ■ x) - ^tr fe XaXbXaXb • (9.24c) 

As before, on the left-hand side of these formulae, 4k x 4k matrix multiplication is implied, whereas 
on the right-hand side, all SO (6) indices are saturated in standard 6- vector inner products, leaving the 
traces over k x k matrices. 

From Eqs. (9.10), (9.20), (9.21), and (9.24a)-(9.24c), one obtains for the bosonic part of the expansion 
of the action V: 

r b = r( 2 ) + r( 3 ) + r< 4 > + • • • (9.25) 

where the quadratic, cubic and quartic actions are now given entirely as /c-dimensional (rather than 2k 
or 4/c-dimensional) traces: 

r( 2 )=tr fc ^ 2 , <p = 2pto-x+^SW°, (9.26a) 

T (3) = - tr k (5W ) 3 + 4p 3 tr k Cl ■ x X • X ~ ^tr fc (Cl ■ X ) 3 + tr fc SW° X • X 

= 2p 2 tr kV (x-x-4(n-x) 2 )+--- , (9.26b) 
T {4) = -^tr fc [a' n , a' m ] 2 + J-^tW ) 4 - tr fc [ Xa , a' n ] [ Xa , a' n ] + 8p 4 tr fe (Cl ■ xf 



- 8p 4 tr fc (Cl-x) X-X + 2p 4 tek (X'X) ~ A r fc XaXbXaXb 
tr, K a»] 2 - 8p 4 tr, (A ' X) 2 X ■ X + 2pHr k ( X ■ xf 



2p4 

+ 16p 4 tr fe (Cl-x) - P 4 tffc XaXbXaXb ~ tr fe [x a , ajj [Xa, a' n ] + ■ ■ ■ . (9.26c) 

Notice that only k 2 fluctuations, denoted ip, are actually lifted at quadratic order. This, in turn, implies 
that certain terms in r( 3 ) and r( 4 ) are subleading, and can be omitted. Specifically, the omitted terms 
in the final rewrites in Eqs. (9.26b)-(9.26c) contain, respectively, two or more, and one or more, factors 
of the quadratically lifted <p modes, and consequently are suppressed in 1/N (as a simple re-scaling 
argument again confirms). 

Now let us perform the elementary Gaussian integration over the <p's. Changing integration variables 
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in Eq. (9.7) from d k ' 2 W° to d k \ using Eq. (9.26a), one finds: 

d k* w o e -N { T(v + m = ^_y 2 /\- N ^> (927) 

where the new induced quartic coupling reads 

rW = -p 4 tr fe (x-X-4(fi-x) 2 ) 2 . (9.28) 

Combining I^ 4 )' with the original quartic coupling (9.26c) gives for the effective bosonic small-fluctuations 
action: 

T b = ~ti k {p- A [a' n , a 1 J + 2\x a , a' n f + p 4 [x„, Xb?) • (9.29) 
Remarkably, all dependence on the unit vector fl a has canceled out. 

Notice that apart from the absence of derivative terms, the expression (9.29) looks like a Yang-Mills 
field strength for the gauge group SU(/c)! We can make this explicit by introducing a ten-dimensional 
vector field Am, 

A M = N 1 / 4 (p- 1 a' n ,pXa) , M = l,...,10, (9.30) 

in terms of which 

NT b {A M ) = --trjfc [A M ,A N ] 2 . (9.31) 

We recognize this as precisely the action of ten-dimensional SU(fc) gauge theory, reduced to dimensions, 
i.e. with all derivatives set to zero. 

Now let us turn to the Grassmann collective coordinates. Since M = 4 supersymmetry in four 
dimensions descends from M = 1 supersymmetry in ten dimensions, and since all our saddle-point 
manipulations commute with supersymmetry, we expect to find the M = 1 supersymmetric completion 
of the ten-dimensional dimensionally-reduced action (9.31), namely 

NFi(A M , *) = -itr fc ^r M [A M , *] , (9.32) 

where \P is a ten-dimensional Majorana-Weyl spinor, and Tm is an element of the ten-dimensional 
Clifford algebra. To see how this comes about, we first separate out from the fermionic collective 
coordinates the exact zero modes, in analogy to Eqs. (9.19a)-(9.19b): 

M'* = -4i£l wxw - Ua' a6i t A + M'* , (9.33a) 
C^ = -4nf% x[fc] + C A . (9.33b) 

Here, £^ and fj aA are the supersymmetric and superconformal fermion modes (4.43)- (4.46). Expanding 
the remaining part of the fermion coupling in the exponent of (9.3) around the special solution (9.16) 
and using the relations (6.3) and (9.26a), we find 



AT f = (87r 2 iV) 1/2 tr fc ( V -2p({l-x))pn AB C A ^ + p^n AB [d' aA ,M' aA ]C 

n (9-34) 

+ XAB( P 2 C A (* + M laA M'a B ) 



laA] ZdtB 
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If we now define the ten-dimensional Majorana-Weyl fermion field ^ 

* = sj\^ [ P - 1/2 (?) ® (J) A# + P V2 (?) ® (?) Cl , (9.35) 

and the Tm matrices according to Eq. (9.36) below, we do in fact recover the simple form (9.32). 
In moving from Eq. (9.34) to Eq. (9.32) we have dropped the term depending on ip; since if is a 
quadratically lifted bosonic mode its contribution is suppressed in large N compared to the other 
couplings in Eq. (9.34), as a simple re-scaling argument confirms. The representation of the SO(10) 
T-matrices is constructed via the decomposition SO(10) D SO(6) x SO(4) in the following way. Firstly, 
We define an SO (6) rotation matrix R, RR T = 1, such that Vt' a = R a b&b lies entirely along, say, the 
first direction, i.e. tl' a oc 8 a \. In the new basis, we have a representation of the SO(6) Clifford algebra 
r' a = Rab^b- In the rotated basis, we can construct a representation of the SO(10) Clifford algebra as 
follows: 

T'm = {r'l ® 7n , T' a ® (<5 al7 5 + (1 - 5al)l[4]x[4]) } , (9.36) 

where n — 1, . . . , 4 and a = 1, . . . , 6. The representation of the SO(10) Clifford algebra that we need is 
then found by un-doing the rotation on the six-dimensional subspace: 

T M = {r; <g> 7 n , (R-^obK ® (5w75 + (1 - S bl )l [4]x[4] ) } . (9.37) 

One can then verify that (9.35) is Majorana-Weyl with respect to this basis. Note that unlike the 
bosonic sector, the Cl a dependence of the fermionic action does not actually disappear; it is simply 
subsumed into the representation of the SO(10) T-matrices. 

Finally our effective large- N /c-instanton partition function has the form 
V ^J^wva^, / P-^P ^ IT • ZJSSS , (^38) 

where £g d u(fcj * s the partition function of an M = 1 supersymmetric SU(fc) gauge theory in ten dimensions 
dimensionally reduced to zero dimensions: 

£suS= / d w Ad w ^e- s ^ , 

Jsv(k) (9>39) 

S(A M ,V) = iV(r b + r f ) = ~tr k [A M ,A N ] 2 -itr k *r M [A M ,V] . 

Notice that the rest of the measure, up to numerical factors, is independent of the instanton number k. 
When integrating expressions which are independent of the SU(fc) degrees-of-freedom, Z^n^l is simply 
an overall constant factor. A calculation of Ref. [115] revealed that £g d u(fcj * s proportional to ^2 d \ k d~ 2 , 
a sum over the positive integer divisors d of k. However, the constant of proportionality was fixed 
definitively in Ref. [116] to give 65 

9 17fe 2 /2+fc/2-9„5fc 2 +fc/2-7/27,-l/2 i 

= ff^ — E h ■ (9 ' 40) 

lli=l d\k 



65 



In comparing to Ref. [116], it is important to note that our convention for the normalization of the generators is 



ti k f r f s = S rs , rather than \8 rs in Ref. [116]. 
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In summary, the effective large- N, semi-classically leading-order, collective coordinate measure has 
the following simple form [21]: 

k -7/2 e 2Kikrf J- 1} f d 5 n J] W . (9.41) 

d\k d J P A=l 



e 27rifcr^(AT=4) i V->oo VN 



"fc 2 33 7T 27 /2 



IX.1.2 The M = 2 case 



We now follow essentially the same steps to deduce a form for the large- N collective coordinates integral 
in the finite M = 2 theory following Ref. [107]. In view of the large- N limit to come, we first re-scale 



X 



N 



2tt 



■X 



Performing the integration over the v A 's and u A, s gives 

JJ d k(N-2k) u A d k(N-2k)-A exp L ^^VJ = (iV7T 2 )* (JV - 2fc) 



det fc x 



t |2(AT-2fc) 



A=l,2 



Similarly, integrating out the matter Grassmann collective coordinates 

■ N7r 2 kN 



d 2kN )Cd 2kN IC exp 




Np=2N 

n Y tekfcffcfX = (-|-) | det fcX| 
/=1 



2N 



(9.42) 



(9.43) 



(9.44) 



Once again, we collect all the terms to the power N to give the large- N "effective action" 

T = - log det 2fe W - 2 log det^ + tr* XaL Xa ■ (9.45) 



We now turn to the solution of the large- iV saddle-point equations. These are the coupled Euler- 
Lagrange equations that come from extremizing T with respect to {a' n , W°,x a }- The analysis of these 
coupled equations is virtually identical to the Af = 4 case in §IX.1.1; hence we suppress the calculational 
details. In particular, the dominant configuration is the maximally degenerate solution (9.16) 

where now Cl a is a unit 2-vector. It is convenient to parameterize Cl a by the phase angle 0: 

n x + i(} 2 = e i4> . (9.46) 



The next stage of the analysis is to expand the effective action in the fluctuations out to sufficient 
order to ensure the convergence of the integrals over the fluctuations. Fortunately, the analysis need 
not be repeated because our effective action (9.45) is, up to a constant, simply the Af = 4 effective 
action (9.10) with the replacements X12 xl v^8, X34 ~^ xV and all other components of xab set to 
zero. The fluctuations in W° are integrated out at Gaussian order to leave integrals over the remaining 
fluctuations in a! n and Xa, the traceless parts, that are lifted at quartic order; the terms beyond quartic 
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order are then formally suppressed by (fractional) powers of 1/N, and may be dropped. The "action" 
for these quartic fluctuations is precisely the action for six-dimensional Yang-Mills theory, as in (9.30) 
and (9.31) but where the indices M — 1, ... ,6. 

Not surprisingly the coupling to the Grassmann collective coordinates completes the six-dimensional 
theory to a supersymmetric gauge theory in six dimensions dimensionally reduced to dimensions. The 
six-dimensional theory has eight independent supersymmetries and is therefore an Af = (1, 0) theory in 
six dimensions. The eight component fermion field of the Af = (1,0) theory has components 

= ^N^e-^ip-^M'*, P 1/2 C A ) , (9.47) 



Following the same steps as in the Af = 4 theory, the effective large- N /c-instanton measure, for the 
leading order semi-classical approximation of the functional integral, has the form 

A7-1/2 p 2-KikT r 2 (9.48) 

k2 9ky2-k/2+12 7r5ifc 2 /2H-4VolU(A;) J P 9 9 ll fl S fl7 7 ^SU(fc) > 

where Zk is the partition function of the Af = (1, 0) supersymmetric SU(fc) gauge theory in six dimen- 
sions dimensionally reduced to zero dimensions: 

£suS)= / d % Ad^e- s(A ^ , 

^SU(fc) (a49) 

S(A M ,V) = iV(r b + r f ) = --tr fc [A M) A N f -iti k ^Y M [A M ,^} . 

When integrating expressions which are independent of the SU(fc) degrees-of-freedom, £gu(fc) is simply 
an overall constant factor that was evaluated in [116]. In our notation 66 

su(fc) y J K J 2v^nti »! k 



In summary, the effective large- collective coordinate measure has the following simple form [107]: 

e 2«ikT Z (M=*,N F =™) N^oo _^N_^ k -7,2 e 2,ikr f <^dp # ^ tf^Atf ^ ^ 

2 71 J P A=1 

This has a remarkable similarity to the form of the Af = 4 measure (9.41). Apart from the differences 
in the overall numerical factors, the integral over S 5 is replaced by S* 1 and the Af — 4 measure involves, 
in addition, the factor Yld\k ■, the sum over the integer divisors of k. Notice that the \/N dependence 
and factor of k~ 7 ^ 2 is the same in both cases. 

66 We have written the result in a way which allows an easy comparison with [116]. The factors of \/2w and \/2 arise, 
respectively, from the difference in the definition of the bosonic integrals and the normalization of the generators: we 
have tr fc T r T s = 5 rs rather than \5 rs . The remaining factors are the result of [116]. 
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The appearance of the e il ^ phase in (9.51) implies a selection rule in order that correlation functions 
are non- vanishing. This is a relic of fermion zero mode counting. From (9.2), we see each insertion of an 
adjoint fermion Grassmann collective coordinate (other than those associated to broken supersymmetry 
and superconformal invariance which drop out from the couplings in (9.2)) implies an insertion of e 1 ^! 2 
and each insertion of a fundamental fermion Grassmann collective coordinate implies an insertion of 
e -«</>/2 rjnj^g se } ec ^j on ru } e i m plies that the difference between the latter and former is 8. Therefore 
the simplest kind of non-vanishing correlation functions would involve insertions that saturate the 8 
integrals over £ A and fj A and, in addition, integrals over 8 of the fundamental fermion Grassmann 
collective coordinates {/C, £}. 



IX. 2 Large-iV correlation functions 

Having established the form for the collective coordinate integral in the large- N limit, we can now 
calculate correlation functions of various composite operators. We are primarily interested in those 
correlators that receive contributions from all instanton numbers. From the form of the integrals (9.38) 
and (9.48), we can now identify those these as the ones for which the leading order (in 1/N) expression 
for the insertion is independent of the SU(/c) variables. Hence the only non-trivial dependence is on 
{X n , p, Cl a } and f] A }. 67 In this case, we can use the reduced form of the integrals (9.41) and (9.51). 
For these special correlation functions the only dependence on the instanton charge will be through an 
overall multiplicative factor as advertised in (9.1). 

Before considering specific insertions, we can make some useful general statements. At leading order 
in 1/N, we can replace each operator insertion with its classical /c-instanton saddle-point value. Since 
the saddle-point solution (9.16) is relatively simple, this observation greatly streamlines the form of the 
operator insertions. We will restrict our attention to operators consisting of a single trace on the gauge 
group index of a product of adjoint scalars, fermions and field strengths. Each of these three adjoint 
quantities is of the type U XU where U is the ADHM quantity defined in (2.47) and X is some matrix 
of ADHM variables; consequently the insertions have the form 

0{x) = t rjv [UX 1 UUX 2 U ■ ■ ■ UX P U] = tr N+2k [VX 1 VX 2 V ■ ■ ■ VX P ] , (9.52) 

where V = UU is the projection operator (2.51). It is easily checked that at the saddle-point, the 
bosonic ADHM quantities /, a' n , L and V collapse to 



1 

{X - Xf + p 2 x WxW ' 

J 1 f ((X - Xf + p 2 )l [N]x[N] - WaW" -Wa{x ~ Xf 

L[2fe]x[2fe] 



/ ~* ~TZ. v\<2 i ^^[*]xW ' a n ~~ * — X n l [k]x[k] , L — > 2p 2 , 

(9.53) 



(x-X) 2 + p 2 V -(x-X) a6l w« p 2 \ { 



One can verify that deviations from these saddle-point values are suppressed by powers of N 1//4 . 

The analogous replacement prescription for the fermionic ADHM quantities is, in general, somewhat 
trickier. Recall the expansions (9.33a) for the Grassmann collective coordinates. In analogy with the 



67 



We will describe how dependence on fl a in the insertion arises in due course. 
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bosonic quantities (9.16)-(9.53), it is useful to think of the unlifted variables £ A and r/ A themselves as 
the arising from a saddle-point evaluation: 



M'* - -4i£l wxw , C A - -Mfj" A l lk]x[k] . (9.54) 

Indeed, the scaling (9.35) implies that the remaining variables denoted M' A and ( A in (9.33a) are 
sub leading compared to £ A and fj A by a factor of iV -1 / 8 . There remain the modes v A and u A , which are 
distinct from the others in that they carry an SU(iV) index u. From their coupling to Xa i n Eq. (9.9), 
one sees that each u A u B pair in an insertion, for a fixed, un-summed value of the index u, costs a factor 
of iV 1 / 2 ; however, summing on u (as required by gauge invariance) then turns this iV" 1 / 2 suppression 
into an iV 1 / 2 enhancement. In other words, v A and v A factors in the insertions should each be thought 
of as being enhanced by iV 1//4 . The large- N rule of thumb for choosing Grassmann collective coordinates 
in correlators is now clear: the insertions should saturate as many of the {u A ,u A } integrals as possible, 
with the proviso that the integrals over the {£ A ,fj A } must be saturated. 

Notice in order to consider insertions of {u A ,u A } and {IC,IC} we must return to return to the 
collective coordinates before these variables have been integrated out. We take the M = 4 case first 
and show, to leading order in 1/N, the gauge invariant combination v A v B in the effective integration 
measure (9.41) is replaced by 

v A v B - -^S^n o l W x W , (9-55) 

71 

where the iV 1//2 dependence has already been noted. To this end, consider a general insertion with a 
string of such combinations v Al v Bl ® • • • <g> u A Pu Bp . We must insert this expression into the measure 
before the v integrals have been performed. Then performing the v integrals as in (9.9) in the presence of 
the insertion leads to a modified expression involving factors of x' 1 which can be derived by considering 

9 „ „ | ,AT-2fc 

det 4fc x 



dXA lBl dXA p B p l (9.56) 

= A^det^xl^V'Wi ® • • • ® {x- 1 )b p a p + ■■■ , 

where t (transpose) acts on instanton indices and the ellipsis represent terms of lower order in 1/N. This 
shows that, after performing the v integrals, a term of the form v A v B is replaced by — {\f~N / \[%tx){x~ X )ba, 
to leading order. Now we replace x with its saddle-point value (9.16) and use (9.5) to give (9.55). The 
replacement elucidates the mysterious appearance of the variables Cl a . 

In the M = 2 theory, examination of Eq. (9.2) yields a similar replacement 



*V - ^^V*1 IWI . (9.57) 



7T 



This leaves the matter Grassmann collective coordinates {/C, /C}. Again after examining (9.2), for each 
flavour / = 1, . . . , 2N, we have the replacement 

JCfK, — ^|e-*% x[fe] . (9.58) 
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We now consider some examples. First of all, in the M = 4 theory there are a class of correlation 
functions of type described above motivated by the AdS/CFT correspondence (discussed in §IX.3). The 
simplest involves the 16-point correlation function of the fermionic composite operator 

A A = g 2 <j mn tr N F rnn \ A . (9.59) 

Since, to leading order in g, each insertion is linear in Grassmann collective coordinates, we only need the 
dependence on the collective coordinates {£ , fj }, since the other Grassmann coordinates must be lifted 
by the instanton effective action. In this case to find the dependence, we can use the supersymmetric 
sweeping out procedure defined in §IV.3 with the x-dependent variation parameter (4.44) giving 68 

A A = ig 5/2 a mn a k £ A (x) ti N F mn F kl + •••. = -\g h/2 {; A {x) tr N F 2 m + • • • . (9.60) 

The ellipsis represent the dependence on the other Grassmann collective coordinates that are not needed. 
Next we can use the identity (C.6) and finally insert the saddle-point expression for / in Eq. (9.53): 

A A = 2Aikg l l 2 i A {x) P - j + • • • . (9.61) 

{(x-xy + p*) 4 

Then using Eqs. (9.41) and (9.61), the large-A" /c-instanton contribution to the 16-point correlator is 
precisely [21] 



k=l 



X 



(9.62) 



For application to the AdS/CFT correspondence it is unnecessary to perform the remaining integrals. 

The correlator (9.62) has the distinguishing property that the functional dependence on the insertion 
points given by the integral term can be reproduced by a single instanton calculation with gauge 
group SU(2). The reason is clear: only the dependence of the insertions on the supersymmetric and 
superconformal Grassmann coordinates was required. In particular the saddle-point evaluation of the 
relevant part of the insertion (9.61) is identical to the single instanton in SU(2). So at large N the 
dependence on k and N only appears in an overall multiplicative factor. In fact there is a whole family 
of related correlators considered in Ref. [117] for which the same property holds and a calculation for a 
single instanton in SU(2) is sufficient to obtain the functional dependence on the insertion points. We 
have now explained the puzzle of why the analysis of Ref. [117] involving a single instanton in SU(2) 
was able to capture effects that ultimately, via the AdS/CFT correspondence should have been valid 
at large- N only. 

There are more general correlation functions involving insertions with non-trivial SO (6) ^-transformation 
properties for which a calculation in SU(2) would not suffice. For example consider the insertion of the 
composite field 

Og.-aM = tr N [<j> ai ■ ■ ■ <p ap F 2 mn ] , (9-63) 

68 The extra factor of g 1 / 2 follows form the re-scaling (4.42). 
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whose form is motivated by the AdS/CFT correspondence. We now consider this evaluated at the saddle 
point. In general, O depends on the Grassmann variables {/i A ,p, A }, as well as the supersymmetric and 
superconformal variables. The resulting expressions are rather cumbersome so we will assume that the 
integrals over the supersymmetric and superconformal variables are saturated by other insertions, for 
example, by sixteen insertions of A A . In this case we only need the dependence of O on the {u A ,u A }: 

[(x - xy + p 2 ) 

At leading order in 1/N we need use the replacement (9.55) to obtain 

o ai ... ap (x) ~ kN^- — -£- — _ - 4 n ai ...n ap . (9.65) 
[{x - xy + p 2 ) 

In this case, we see that the insertion depends explicitly on the S 5 coordinate Cl a . 

In the M = 2 theory, a simple correlator involves 8 insertions of A A , as in (9.59) where now A = 1, 2, 
and 4 insertions of the gauge-invariant operator 

Q f {x) = g 2 tr NX fXf , (9-66) 

which is quadratic in the matter Grassmann collective coordinates. Evaluating this on the saddle-point 
solution using the expression (6.53) along with the leading-order replacement (9.58), we have 

Q ' {x) = ""ma v^x ^ + • • ' ' (9 ' 67) 

7rViv ((x - X) 2 + p 2 ) 

Notice that the 4 insertions of Qf involve a factor of e~ 41 ^ which cancels the factor of e 4 ^ in Eq. (9.51), 
so the integral over yields the constant 2ir. Hence, 

<A5(*f>) x ... x A^>)Q fl{x <<>>) x ... x 4^ { £ jW* } 

(9 68) 

i p 5 H B((xV)-xy + p 2 ) 4 fi 



f=i - x) 2 + pa) 4 f = -j ((x(w) - xy + p 2 )' 



IX. 3 Instantons and the AdS/CFT correspondence 

The formalism that we have used to calculate the instanton contributions to certain correlation functions 
at leading order in 1/N is particularly interesting in light of the AdS/CFT correspondence. The 
basic example of the AdS/CFT correspondence, due to Maldacena [112] (see also the comprehensive 
review [113]), states that the Af = 4 supersymmetric gauge theory is equivalent to Type IIB string 
theory on an AdS 5 x S 5 background. In particular, the gauge coupling g and vacuum angle 9 of the 
four-dimensional theory are given in terms of the string parameters by 

g = ^Ang st = VAire* , 9 = 2nC (0) . (9.69) 
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Here, g st is the string coupling while and are the expectation values of the dilaton and Ramond- 
Ramond scalar, respectively, of Type IIB string theory. Also N appears explicitly, through the relation 

^ = (9.70) 

a 

where (a')" 1 is the string tension and L is the radius of both the AdS 5 and S 5 factors of the background. 



IX. 3.1 The instanton collective coordinate integral 



Even before we consider application of the AdS/CFT correspondence to correlation functions, and how 
it applies to instantons, we can already see in the form of the effective large-iV collective coordinate 
integral (9.41) a remarkable relation to AdS 5 x S* 5 . The c-number integrals are precisely the volume 
form on this space, where the AdS$ part has the metric 

ds 2 = p- 2 (dXl + dp 2 ) (9.71) 

so that p is the radial variable. This fact alone strongly suggests that Yang-Mills instantons are identified 
in the large- N limit with D-instantons in the Type IIB string theory. Some relevant aspects of D-branes 
in Type II string theory are summarized in §X.3. In particular, the contribution of k D-instantons to 
low energy correlators is governed by the U(fc) matrix integral (10.52). For the most part, the SU(fc) 
part of the integral can be factored off as an overall numerical constant; hence, 



e -27rfc(e-'*+iC*( )) 

VolU(fc) 



J d w Xd w e ■ Z k , (9.72) 



where is the same SU(/c) matrix integral that we defined in (9.49) and Xm and are the abelian 
components of the fields. This D-instanton collective coordinate integral is appropriate to the case of 
flat ten-dimensional space. However, the results of Banks and Green [118] imply that since AdS 5 x S 5 is 
conformally flat a similar expression should apply to this background with the bosonic abelian integrals 
replaced by the appropriate volume form of AdS$ x S* 5 . Being careful with the pre-factor, one has the 
D-instanton collective coordinate integral [21,117] 



AdS 5 xS 5 



{ar\ke^)- 7 l 2 e- 2 ^-^ iC{ ^Y,]p f ^^d^d^Q. (9.73) 



Using the relation between the couplings of the string theory and the gauge theory in (9.69) this is 
precisely the leading order collective coordinate integral in the large- N limit of the M = 4 instanton 
calculus (9. 41). 69 This equivalence of the large- N Yang-Mills instanton measure and the D-instanton 
measure on AdS 5 x S 5 is rather stunning evidence for the AdS/CFT correspondence. It is very satisfying 
that instantons at large iV seem to probe the AdS$ x S 5 geometry directly. Although, as discussed below, 
this relation implies the existence of an as yet un-proved non-renormalization theorem for a certain class 

There appears to be a mismatch of g s , however, this is due to our normalization of the Grassmann collective co- 
ordinates. Re-scaling the supersymmetric and superconformal collective coordinates, £ A and fj A , by g 1 ^ 2 produces the 
missing factor of g 8 . 
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of correlation functions (like those discussed in §IX.2) protecting them against perturbative corrections 
in g 2 N . In what follows, we shall make this correspondence between large- N instanton and D-instanton 
effects even more convincing by considering these correlation functions. 

In §X.3 we will establish a relation between Yang-Mills instantons and D-instantons which is ap- 
parently rather different from that we have just described. We will show in §X.3 how k D-instantons 
in the presence of N D3-branes are precisely identified with k Yang-Mills instantons in U(iV) gauge 
theory describing the collective dynamics of the D3-branes. This is true in the limit a' — > for fixed 
g st (in other words fixed coupling g on the D3-branes) in which bulk supergravity modes decouple 
from the world-volume theory of the D3-branes. The question is how this description in terms of D- 
instantons moving in the background of D3-branes relates to the description of D-instantons moving in 
the AdS$ x S 5 background (but with no D3-branes) established above? The answer is interesting be- 
cause it illuminates some basic features of the AdS/CFT correspondence. In the limit of large N (with 
g 2 N large and small g) the background of D3-branes is replaced by its near-horizon geometry, namely 
AdS§ x S 5 . In a certain sense the D3-branes disappear to be replaced by non-trivial geometry. We can 
see this happening explicitly with the D-instanton/D3-brane system. The presence of the D3-branes in 
the D-instanton matrix theory is signaled by the fundamental hypermultiplet variables {w&,f/,}, and 
their conjugates, describing open strings stretched between the D-instantons and D3-branes. At large 
N, these degrees-of- freedom can be integrated out in the way described in §IX.l to yield an effective 
collective coordinate integral (9.38) which exhibits the AdS$ x S 5 geometry explicitly. So the analogue 
of taking the near horizon geometry is the process of integrating out the degrees-of-freedom of open 
strings stretched between the D-instantons and D3-branes and taking the large- N limit. The remaining 
puzzle is that this should be done at weak coupling, i.e. small g 2 N, whereas the dual supergravity re- 
gion of the AdS / CFT correspondence is valid at large g 2 N. Yet again, this strongly suggests that some 
non-renormalization theorem in g 2 N is a work: something we shall comment on in the next section. 

IX. 3. 2 Correlation functions 

A more precise statement of the AdS/CFT correspondence was presented in Refs. [119,120]. The M = 4 
gauge theory is to be thought of living on the four-dimensional boundary of AdS§. In particular, each 
chiral primary operator O in the boundary conformal field theory is identified with a particular Kaluza- 
Klein mode of the supergravity fields which we denote as $e>. In general it is not known how to solve 
string theory on an AdS background. However, the AdS/CFT correspondence is still useful because 
in a certain limit we can approximate the full string theory by its supergravity low-energy limit. This 
requires weak coupling (small g) but, in addition, the length scale L must be large compared with the 
string length scale \fa! '. This latter requirement is met when the 't Hooft coupling g 2 N of the gauge 
theory is large and conventional perturbation theory breaks down. In this limit [119,120], the generating 
function for the correlation functions of O is then given in terms of the supergravity action Shb^o] 
according to 




(9.74) 
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The IIB action on the right-hand side of the equation is evaluated on a configuration which solves the 
classical field equations subject to the condition $o(x) = Jo(x) on the four-dimensional boundary. 



In most applications considered so far, the relation (9.74) has primarily been applied at the level of 
classical supergravity, which corresponds to N — > oo, with g 2 N fixed and large, in the boundary theory 
[119-132]. However, the full equivalence of IIB superstrings on AdS$ x S 5 and M = 4 supersymmetric 
Yang-Mills theory conjectured in [112] suggests that (9.74) should hold more generally, with quantum 
and stringy corrections to the classical supergravity action corresponding to g 2 and 1 / (g 2 N) corrections 
in the J\f = 4 theory, respectively. The particular comparison that concerns us here is between Yang- 
Mills instanton contributions to the correlators of O generated by the left-hand side of (9.74) and 
D-instanton corrections to the IIB effective action on the right-hand side. 

Let us describe these D-instanton effects in more detail. Before we look at some specific correlators, 
let us consider in more detail the effects of D-instantons in the string theory [118] (closely following the 
more detailed treatment in [117]). In [133], Green and Gutperle conjectured an exact form for certain 
non-perturbative (in g st ) corrections to certain terms in the Type IIB supergravity effective action. 
In the present application, where the string theory is compactified on AdS$ x S 5 , it is important for 
the overall consistency of the Banks-Green prediction that the non-perturbative terms in the effective 
action do not alter the AdS 5 x S* 5 background, since the latter is conformally flat [118]. In particular, 
at leading order beyond the Einstein-Hilbert term in the derivative expansion, the IIB effective action 
is expected to contain a totally antisymmetric 16-dilatino effective vertex of the form [134, 135] 



Here A is a complex chiral SO(9, 1) spinor, and fi$ is a certain weight (12, —12) modular form under 
SL(2, Z). At the same order in the derivative expansion there are other terms related to (9.75) by 
supersymmetry and involving other modular forms [134-136] and, in particular, f n (r,f), with n = 4 
and 8. The modular symmetry is precisely ^-duality of the Type IIB superstring, and although this 
does not completely determine the modular forms /„, for the n = 4 term, Green and Gutperle [133] 
made the following conjecture, later proved in Ref. [137] and generalized to n ^ 4 in [135, 136]: 



(p,<z)^(o,o) 

These rather arcane expressions turn out to have the right modular properties, i.e. weight (n— 4, — n+4), 
and also have very suggestive weak coupling expansions [133-135,138]: 




(9.75) 





(9.76) 



2vr 2 



oc 



e~*' 2 f n = 327T 2 C(3)^ 4 



3(9-2n)(7-2n) 



'k,n i 



(9.77) 



k=i 



where 




(9.78) 
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and the numerical coefficients are 

(-l) n ^87rr(3/2)r(r- 1/2) 
C "' r ~ 2T(r -n+ l)T(n + 3/2)T(-r - 1/2) 

As previously, the summation over d in (9.78) runs over the integral divisors of k. Notice that, having 
taken into account the conjectured correspondence (9.69) to the couplings of four-dimensional Yang- 
Mills theory, the series (9.77) has the structure of a semi-classical expansion: the first two terms 
correspond to the tree and one-loop pieces, while the sum on k is interpretable as a sum on Yang- 
Mills instanton number, the first and second terms in the square bracket being instantons and anti- 
instantons, respectively (as dictated by the 6 dependence). Each of these terms includes a perturbative 
expansion around the instantons, although notice that the leading order anti-instanton contributions 
are suppressed by a factor of g 4 ™ -16 (so not suppressed for n — 4) relative to the leading order instanton 
contributions. 



(9.79) 



Let us focus on the leading semi-classical contributions to the f n ; by this we mean, for each value of 
the topological number k, the leading-order contribution in g 2 . For and fs the leading semi-classical 
contributions come from instantons only and have the form 

n-7/2 



H2 fn = const- (AT e ^ T Ei' ( 9 - 8 °) 

fc-instanton \9 J ^ « 



d\k 

neglecting g 2 corrections. For the special case of f± there is an identical anti-instanton contribution 
with it — > —if. Comparing with (9.73), we see that the terms in the square bracket in (9.78), which 
are non-perturbative in the string coupling, are interpreted as being due to D-instantons. 

From the effective vertex (9.75) one can construct Green's functions Gi 6 (x^\ . . . ,x^) for sixteen 
dilatinos A(x^), 1 < I < 16, which live on the boundary of AdS 5 : 

G 16 = (A(x«) x ... x A(x^)) ~ (a')' 1 e-*/ 2 f ie t 16 [ f[ K? /2 (X, p; x«, 0) (9.81) 

J P i=i 

suppressing spinor indices. Here Ky 2 is the bulk-to-boundary propagator for a spin-| Dirac fermion of 
mass m = — \L~ l and scaling dimension A = | [119, 120, 123, 139]: 

K( /2 (X, p; x, 0) = K 4 (X, p; x, 0) (p 1/2 75 - p' 1/2 (x - X) nl n ) (9.82) 

with 

K 4 (X, p; x, 0) = / 4 • (9.83) 

[p 2 + {x — xy) 

In these expressions the are four- dimensional spacetime coordinates for the boundary of AdS 5 while 
p is the fifth, radial, coordinate. The quantity tie in Eq. (9.81) is (in the notation of Ref. [117]) a 
16-index antisymmetric invariant tensor which enforces Fermi statistics and ensures, inter alia, that 
precisely 8 factors of p^ 2 75 and 8 factors of p~ 1 l 2 ^ n are picked out in the product over Ky 2 . It is not 
difficult to see that the form of (9.81) matches precisely the large-iV instanton contribution to the 16- 
point correlator in the gauge theory (9.62). In particular, the dilatino corresponds, on the field theory 
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side, to the fermionic composite operator defined in (9.59) [117]. This shows that not only does the 
form of the D-instanton collective coordinate integral match the large- N instanton collective coordinate 
integral, but, in addition, the various bulk-to-boundary propagators on the supergravity side arise in 
the field theory as operator insertions evaluated in the large-iV instanton background. In the case of 
Giq, and related correlators, because of the nature of the saddle-point solution, the insertions have the 
form of k times the same insertion in the theory with gauge group SU(2) at the one-instanton level 
(precisely the case considered in [117]). 

Of course this equivalence between D-instanton and Yang-Mills instanton effects in correlation func- 
tions extends to the other cases, G 8 and G 4 , related by supersymmetry. Note that there is no explicit 
dependence in these expressions on the coordinates on S 5 ; in particular, the propagator does not depend 
on them. This is because G 16 , G$ and G 4 are correlators of operators whose supergravity associates are 
constant on S 5 . Generalizations involving non-trivial dependence on S 5 are possible [21]. 

As emphasized above, the comparison between the Yang-Mills and supergravity descriptions can 
be quantitative if and only if there exists a non-renormalization theorem that allows one to relate 
the small g 2 N to the large g 2 N behaviour of chiral correlators such as G n , as has been suggested in 
Refs. [140]. In the absence of such a theorem the best one can hope for is that qualitative features of 
the agreement persist beyond leading order while the exact numerical factor in each instanton sector 
does not, in analogy with the mismatch in the numerical pre-factor between weak and strong coupling 
results for black- hole entropy [120]. In our view, however, our results provide strong evidence in favor 
of such a non-renormalization theorem for the correlators G n , for the following reason. Consider the 
planar diagram corrections to the leading semi-classical (i.e. g 2 N — > 0) result for, say, G\q, Eq. (9.62). 
In principle, these would not only modify the above result by an infinite series in g 2 N, but also, at 
each order in this expansion, and independently for each value of k, they would produce a different 
function of spacetime. The fact that the leading semi-classical form for G±q that we obtain is not 
only /^-independent, but already reproduces the spacetime dependence of the D-instanton/supergravity 
prediction exactly, strongly suggests that such diagrammatic corrections (planar and otherwise) must 
vanish. (Note there are necessary subleading corrections, both in 1/N and in g 2 , to our leading semi- 
classical results.) By evaluating the possible form of non-perturbative corrections on the string theory 
side of the correspondence, it has been argued that the non-renormalization theorem is rather natural 
[141]; however, there is still no purely field theoretic proof. 

It is interesting to generalize the relation of large- N instantons in other conformal gauge theories 
to the D-instantons via the more general AdS / CFT correspondences. This has been done for the finite 
Af = 2 theories with product gauge group SU(N) k in [111], the Af = 4 Sp(iV) and SO(iV) theories 
in [108] and for the finite N = 2 Sp(iV) theories in [109, 110]. 

The finite Af = 2 theory with gauge group SU(iV) and 2N fundamental hypermultiplets has not yet 
been discussed in this respect. However, it is not difficult to find a set-up which can be used to discuss 
this case. Here we sketch the details. The idea is to consider the finite Af = 2 theory with product 
gauge group SU(iV) x SU(iV) and matter in bi-fundamental representations of the gauge group. This 
theory can be realized, as explained in §X.2.5, on the world- volume of N D3-branes lying transverse to 
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the orbifold IR 2 x (R 4 /Z 2 ), where Z 2 acts by inversion. The near horizon geometry of the D3-branes is 
AdS$ x S' 5 /Z2, where Z 2 acts on S 5 , realized as x 2 a = 1 by taking x 3 , x 4 , x 5 , x 6 — > —x 3 , —x 4 , — x 5 , — x 6 , 
leaving x 1 and x 2 fixed. So the AdS/CFT correspondence in this case involves Type IIB string theory 
on the AdS$ x S 5 /7*2 background. Notice that the Z 2 leaves an S* 1 C S 5 fixed and so in this case the 
dual geometry is not smooth. Now consider instanton effects in one of the SU(iV) gauge group factors. 
At leading order in such an instanton background, the adjoint- valued fields of the other gauge remain 
zero and so the instanton effects in question are identical to the SU(iV) theory with 2N hypermultiplets 
(the latter arising from the bi-fundamental hypermultiplets of the original theory). On the string theory 
side, these instanton effects of charge k will be related to k fractional D-instantons. These fractional D- 
instantons are to be though of as D-strings which are wrapped around the non-trivial S 2 of the smooth 
resolution of the orbifold IR 4 /Z 2 in the limit that the S 2 cycle collapses. In the near-horizon geometry 
these fractional instantons are consequently stuck at the orbifold singularity of S 1 C S 5 /Z 2 . So the 
fractional D-instanton collective coordinate integral involves an integral over the six dimensional space 
AdS§ x S* 1 rather than ten-dimensions. Furthermore, this measure includes, in analogy with (9.72), 
a partition function of six-dimensional SU(/c) gauge theory dimensionally reduced to zero dimensions. 
This is precisely what is seen in the large- N instanton collective coordinate integral (9.51). It would be 
interesting to extend the analysis to particular correlation functions in order to place the relationship 
on the same footing as the Af = 4 case. 
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X Instantons as Solitons in Higher Dimensions and String 
Theory 



Instantons are classical solutions of Euclidean gauge theories in four-dimensional spacetime with finite 
action. They are consequently localized in spacetime rather than space. However, it will turn out to be 
rewarding to think about instantons in gauge theories in dimensions greater than four. In these other 
contexts, if the higher-dimensional theories are defined in Minkowski space then the instanton solutions 
have an interpretation as solitons. For example, consider five-dimensional gauge theories. We can 
easily take an instanton solution of four-dimensional gauge theory and embed it in the five- dimensional 
theory, by identifying the coordinates x n , n = 1,2,3,4, on which the instanton solution depends, with 
the Euclidean subspace of the five-dimensional Minkowski space 70 coordinates y N , N — 0, 1, 2, 3, 4, with 
y n = x n , n = 1,2,3,4. The resulting solution of the five-dimensional Yang-Mills equations is then 
independent of the time coordinate y°. The solution, which had finite action in four dimensions, now 
has finite energy in five dimensions and so is interpreted as a lump localized in space but evolving 
in time. In other words, the instanton is a particle-like soliton in five dimensions. Adding an extra 
dimension yields a much richer system: instantons in four-dimensional gauge theories have no dynamics, 
but when lifted to five dimensions they can move in the four dimensional space and have complicated 
dynamics. 

We can continue this process of embedding instantons in gauge theories of even higher dimension. In 
six dimensions, the embedded instanton solutions are independent of two coordinates and so, thinking 
of one of them as a time direction, means the solution is extended in a single space direction; in 
other words, it is a string-like "defect". In seven dimensions, one gets a membrane and so on. Let us 
introduce some modern terminology, arising originally in the context of supergravity and string theory, 
which affords us a certain brevity in talking about these more exotic possibilities. Consider gauge 
theories in D-dimensional Minkowski space. An instanton embedded in this theory will be independent 
of D — 4 coordinates (one being the time). This means that the solution will be extended in D — 5 
spatial dimensions. Such a configuration is called a U (D — 5)-brane" . The case with D — 4, describes 
a solution which is localized in both space and time dimensions and this is precisely the situation for 
the original instanton of four- dimensional gauge theory. This case is therefore a "(— l)-brane" . For 
D = 5, the solution is localized in space, but not time, and so the instanton lump-like soliton of five- 
dimensional gauge theory is a 0-brane. It may seem strange to think about instanton solutions in higher 
dimensional gauge theories but one of the major lessons of string theory and its generalizations is that 
it is compulsory to think in arbitrary numbers of dimensions and there are big advantages in doing so. 

We will show how the instanton moduli space plays an important role in describing the dynamics of 
instanton-branes in higher-dimensional gauge theories. In §X.1.1 we begin by discussing the case of the 
pure gauge instanton. Supersymmetric generalizations will be considered in §X.2. It is worth pointing 
out that much of the analysis for instantons is very closely related to the problem of describing the 
semi-classical behaviour of monopoles in supersymmetric gauge theories described in Refs. [142-144], 
and references therein. In particular the description of instanton lumps in five- dimensional gauge 

70 In Minkowski space we choose a metric t]mn = diag(— 1, !,...,!). 
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theory with either 4 or 8 supercharges (corresponding to Af = 2 and Af = 4 supersymmetry in four 
dimensions) is closely related to the description of monopoles in four-dimensional gauge theories, with 
Af = 2 and Af = 4 supersymmetry, respectively, since both involve a quantum mechanical a-model 
on the appropriate moduli space. In both cases, the instanton and monopole moduli space are both 
hyper-Kahler spaces and so the structure of the cr-models is identical. The main difference is that the 
cx-model in the instanton case admits a straightforward linear realization via the ADHM hyper-Kahler 
quotient construction while in the monopole case the analogous quotient construction based on Nahm's 
equations is somewhat more complicated. 

After we have described the dynamics of instanton branes, we will, in §X.3 describe how such 
objects appear naturally in string theory. In this context they correspond to D-branes dissolved within 
other higher-dimensional D-branes. We will show how solving for the low energy dynamics of these 
configurations of D-branes actually leads directly to the instanton calculus. Not only is the ADHM 
construction obtained directly, but also the leading order expression for the collective coordinate integral 
for the Af = 4 and Af = 2 supersymmetric theories. We also briefly describe how the actual profile of 
the instanton can be obtained by using a suitable probe brane. 

X.l Non-supersymmetric instanton branes 

We begin with the case of pure gauge theory. The key idea, which originated with Manton [145] and 
was elaborated by Ward in the context of a 2 + 1-dimensional model [146], is that the dynamics of 
slowly moving solitons can be approximated by assuming that the evolution is adiabatic in the moduli 
space of classical solutions. In many cases involving lumps the time evolution of the system is governed 
by the geodesic motion on the moduli space with respect to the Levi-Civita connection induced by the 
metric that arises as the inner-product of the zero modes (2.32). The intuitive idea that lies behind 
this approximation is that, for sufficiently low velocities, the non-zero modes of the fields are only very 
weakly excited and therefore it is consistent to ignore these modes to leading order. 

In our case we start with D > 4-dimensional gauge theory in Minkowski space with coordinates y M , 
M — 0,1, . . . , D — 1 having the conventional action 71 



Here, go is the gauge coupling in D dimensions which has dimensions of [L]^ D A ^ 2 . 

We now consider how to embed an instanton in the D-dimensional theory. To this end, we decom- 
pose D-dimensional Minkowski space into D — 4-dimensional Minkowski space and four- dimensional 
Euclidean space, by taking y M = (£ a , x m ), a — 0, ... ,p, where p = D — 5, and m — 1, 2, 3, 4. We now 
embed the instanton solution in the subspace x m by taking 



71 In this and following sections, we will choose a normalization for the fields where the coupling constant appears 
outside the action. So the field strength, for instance, has no factor of g in front of the commutator: Fmn = Qm An — 
d N A M + [A M , A N ]. 




(10.1) 




(10.2) 
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where A m (x;X) is the instanton solution constructed in §11 (with coupling g set to 1). Recall that 
denote the collective coordinates of OTT^. This ansatz obviously satisfies the equations-of- motion of the 
D-dimensional theory and represents an object extended in p = D — 5-space dimensions, i.e. a p-brane. 



X.l.l The moduli space approximation 

Manton's moduli space approximation amounts to modelling the dynamics of the p-brane by allowing 
dependence on the transverse coordinates £ a (so including time £°) to enter implicitly through the 
collective coordinates. Of course, the original classical solution with ^"-dependent collective coordinates 
A N (0, x; X(£)) does not satisfy the classical equations-of-motion. The idea is that for sufficiently slowly 
varying X^(£) it is almost a solution. The point can be illustrated with the position coordinates of 
the centre of the instanton X n . In this case we know the exact solution for motion in time t = £° 
with constant velocity because we can Lorentz-boost the static solution. The boosted solution only 
approximates An(0,x; X n = v n t) if the velocity |i>| <C 1. 

The moduli space approximation is an expansion in powers of £ a derivatives. In order to find 
the leading order effective dynamics we have to substitute the instanton solution with ^"-dependent 
collective coordinates into the action. This then yields an effective action for the collective coordinates 
now interpreted as fields on the p + 1-dimensional world- volume of the p-brane: X M (£). In order 
that the effective action is at least quadratic order in the £ a derivatives, we must ensure that the 
equations-of-motion are satisfied to linear order in the £ a derivatives. In components (A a ,A n ), the 
equations-of-motion, V N F MN = 0, are 

+ V a {d a A n - V n A a ) = , (10.3a) 
V n iV n A a - d a A n ) + V b F ab = . (10.3b) 

Now we substitute in the ^-dependent instanton solution A n = A n (x; X (£)) . To linear order in £ a we 
can ignore the second term in (10.3a) and the second term in (10.3b); this leaves 

(|^^-2Vg=0, (10.4) 

for A a . Notice that this equation is very similar to the background gauge condition for an instanton 
zero mode (2.31). Indeed, if we take 72 

A a = Q, jX d a X lx , (10.5) 

where is the compensating gauge transformation associated to the collective coordinate X^, then 
(10.4) is satisfied by virtue of (2.31). Since A a is linear in <9 a X M , it is then trivial to see that the ansatz 

a n = (n,d a x^), Mx; x(£)j) , (10.6) 

satisfies the equations-of-motion (10.3a)-(10.3b) to linear order in ^-derivatives. In making this ansatz, 
we have fixed the gauge symmetry of the theory in a rather unconventional way. We are working in the 

72 The following discussion and its supersymmetric generalization is motivated by the treatment of the analogous 
monopole problem in Rcfs. [142,144]. 
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gauge where A = f2 M <9 X M . Of course we could always return to a more familiar gauge, say A = 0, by 
performing a gauge transformation on the ansatz. 

Since the ^"-dependent expression (10.6) is only an solution of the equations-of-motion to linear 
order in £ a derivatives, it will now contribute non-trivially to the action at quadratic order: 



S (2) = 4r f d p+ ^d 4 xtT N 5,A n (x;X(0)d a X^OS I/ A n (x;X(0)d a X"(0 

(10.7) 

d? +l tig, v {X)d a X»d a X v , 



9b 



1 

where g^ v (X) is the metric tensor on the instanton moduli space defined in (2.32). 

The effective collective coordinate dynamics embodied in (10.7) has the form of a cr-model in p + 1- 
dimensions whose target space is the moduli space 9Jlk- The classical collective coordinate dynamics 
follows from the equation-of-motion 



d a d a X» + l g ^^-d a X p d a X a = . (10.8) 



For the case with D = 5, where the instanton is a lump in five dimensions, this is simply the equation 
for geodesies in 9Jt fc . 



X.2 Supersymmetric instanton branes 

In this section, we will investigate how the description of instanton-branes extends to the supersym- 
metric theories. Necessarily the theories must have extended supersymmetry from the four- dimensional 
perspective, because the Af = 1 theory cannot be obtained by dimensional reduction from higher di- 
mensions. Since theories with the same number of supercharges are related by dimensional reduction, 
we loose no loss of generality by considering the theories in their maximal dimension: D = 10 for the 
theory having 16 supersymmetries (Af = 4 in D = 4) and D = 6 for the theory having 8 supersymme- 
tries (Af = 2 in D = 4). In the last section we have shown how the moduli space approximation involves 
a p + 1 = D — 4-dimensional a-model with 9R k as target. In the supersymmetric case, we expect the 
moduli space approximation to lead to a supersymmetric a-model with the appropriate number of su- 
persymmetries. Since an instanton breaks half the supersymmetries of the parent theory (see §IV.2.3), 
we expect this cr-model to manifest half the supersymmetries; i.e. have 8 supercharges, in the p = 5 
case, and 4 supercharges in the p = 1 case. Furthermore, the supersymmetry in both cases is chiral, 
being M = (0, 1) in six spacetime dimensions and Af = (0, 4) in two spacetime dimensions, respectively. 
It is a well-established fact that a-models with the relevant kinds of supersymmetry only exist in these 
dimensions if the target space is hyper-kahler [147-149]. In some respects, the very fact that instantons 
can be embedded as branes in the higher-dimensional supersymmetric gauge theories "proves" that the 
instanton moduli space DJlk has to be hyper-Kahler. 

What is interesting about these a-models is that on Wick rotation and complete dimensional reduc- 
tion to zero dimensions their partition functions reproduce the leading-order semi-classical expression 
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for the collective coordinate integral of the instanton calculus. This includes both the volume form on 
the instanton moduli space as well as the non-trivial instanton effective action. In fact the a-models 
have both a non-linear realization, where the target space is directly DJlk, but also, importantly, a linear 
realization. In this second formulation there is an auxiliary supermultiplet involving a non-dynamical 
U(/c) gauge field. It turns out that this linear formulation is directly related to the ADHM construc- 
tion, where the bosonic and fermionic ADHM constraints arise from integrating out Lagrange multiplier 
fields. However, there is more. When one dimensionally reduces these cr-models there are certain kinds 
of deformation which can be added in form of very special potentials [143, 150, 151]. These potentials 
naturally arise in the instanton calculus when one moves onto the Coulomb branch of the original theory 
and correspond to the adjoint- valued VEVs described in §V.2. In the 2-dimensional cr-model example, 
we also have the freedom to add other fields to the cr-model which correspond in the M = 2 instanton 
calculus to the Grassmann collective coordinates of hypermultiplet matter fields. We will subsequently 
show how these cr-models arise very naturally in the context of D-branes in string theory. 



X.2.1 Action, supersymmetry and equations-of- motion 

The actions of the D = 6 and D — 10 theories with 8 and 16 supercharges, respectively, can both be 
written as 



S= ^J d D yti N {\F MN F MN -i^T M V M ^ . (10.9) 
The theory is invariant under the supersymmetry transformations 

5A N = -ZT N y, (10.10a) 
5^ = iT MN EF MN . (10.10b) 

As in the purely bosonic case, y M = (t; a ,x m ), where £ a , a = 0,1, ... ,p, are p + 1 = D — 4-dimensional 
Minkowski space coordinates and x m , m = 1,2,3,4, are four-dimensional Euclidean coordinates. 

The spectrum of fields in both case consists of a gauge field A N = (A a , A n ), with a = 0, 1 in D = 6 
and a — 0, 1, . . . , 5 in D — 10, and the minimal spinor ^. (Our conventions for spinors are described in 
Appendix A.) In D = 6 this is a Weyl, while in D = 10 it is a Majorana-Weyl, spinor. In both cases, 
it is convenient to represent the T-matrices using a tensor product notation that reflects the subgroups 
of the Lorentz group, SO(4) x SO(l, 1) and SO (4) x SO(5, 1), respectively: 

Ttv = {r a ® 7 5,l®7n} , (10.11) 

where 7„ are the SO(4) 7-matrices, (A. 7), and T a are the SO(l, 1) and SO(5, 1) T-matrices, in the two 
cases, respectively. In both cases we write 

r. = (l E ") , (10.12) 



where, for SO(l, 1), S a = (1, 1) and S a = (—1, 1) and for SO(5, 1) the S-matrices are defined in (A. 19). 
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In both D = 6 and 10, in the tensor product notation (10.11), a Weyl spinor can be written 

* = © ® (J) A ° + (?) 8 (?) A " • < 10 - 13 ' 

where the a and ct are four- dimensional Euclidean space spinor indices. In the D = 6 case, ^ is a 
pseudo-real spinor and Aq and \ a are independent complex quantities. It is convenient in this case, 
to introduce the notion of a two-component symplectic real spinor (see Appendix A). In terms of the 
component spinors, we define X A and X% A = 1, 2, via 

Ai = ^(A /J ) t , A^ = ^A a , Ai=^A°, A* = -^c*»(A*)t . (10.14) 

The reason for choosing these definitions will emerge shortly. To accompany the two-component spinors 
we also define the S-matrices 

Ef* = Y, a e AB , Y, aAB = S a e AB . (10.15) 
By virtue of (10.14), the two-component spinors satisfy the pseudo-reality conditions 

(A A )t = T? AB X B , (A A )t = Y? AB X B . (10.16) 



In the D = 10 case, \1/ is subjected to a Majorana spinor condition, which means in terms of the 
component spinors, \ A and \a, A = 1,2,3,4, that 

(A A )t = fP AB \ B , (A A )t = Y? AB \ B . (10.17) 

Notice that this is identical in notion to the reality condition for the symplectic real spinor in D = 6 
case (10.16) (and explains our choice of definition). 

In terms of the spinors A and A, the action (10.9) is 

S = ^J dDytTN \\ F mnF mn + 2V n \ A a n \ A - i\ A Z aAB V a \ B - i\ A T l c AB T) a \ B j- . (10.18) 

Our definitions have been chosen so that for ^-independent field configurations, the Lagrangian density 
is identical to minus that in (4.13) with the relation A a = i<f) a and with the metric on the a-indices 
changed from a Euclidean to Minkowski signature. The subgroups of the Lorentz group, SO(l, 1) and 
SO (5, 1), respectively, are identified, after the signature change, with part of the i?-symmetry group of 
the four- dimensional theory, namely SO (2) and SO (6), respectively. The other difference is the reality 
condition on the spinors: compare (10.16) or (10.17) with the situation in the four-dimensional context 
where the spinors \ A and A^ are independently real. 

The equations-of-motion which follow form (10.18) are 

V rn F mn + V a F an = 2a n {\ A , \ A } , (10.19a) 

f\ A = -%Z aAB V a \ B , (10.19b) 

Tp\ A = -iY, AB V a \ B , (10.19c) 

V n F na + V b F ah = iY, aAB \ A \ B + iT, AB X A X B . (10.19d) 
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X.2.2 The moduli space approximation 

The case of pure gauge theory the effective moduli space dynamics involved an expansion in £ a deriva- 
tives. When there are fermion fields are involved, we also allow the associated Grassmann collective 
coordinates to depend on £ a . The moduli space approximation is then an expansion in n = rig + \nf. 
the number of £ a derivative plus half the number of Grassmann collective coordinates. The lowest 
non-trivial term in the effective action are those terms of order n = 2 and in order to derive them we 
must solve the equations-of- motions up to order n = l. 73 

The equations to order n = are solved, as in §X.l, by embedding the instanton solution as 
A N = (0, A n (x; X (£))). At the next order, n — \, the fermions \ A satisfy the covariant Weyl equation 
(4.24a) with the solution \ A = A(M A ), where M A are the Grassmann collective coordinates. In order 
to extend the notion of the moduli space approximation to the supersymmetric theories we should also 
allow the Grassmann collective coordinates M. A to depend on £ a . 

At order n — 1, we have the following equation (generalizing (10.4))): 

V n {g^d a X>* - V n Aa) = -i^ aAB \ A \ B ■ (10.20) 

As in the case of pure gauge theory, we can use the background gauge condition on the zero modes of 
A n to solve these equations for A a . The new ingredient is the presence of the source bi-linear in fermion 
zero modes on the right-hand side. However, as the equation is linear in A a we can take the linear 
combination 

A a (x; X(0,M A (0) = n M (x; X(0)d a X^) + uf> a (x; X(£),M A (Z)) , (10.21) 
where the Hermitian field (f> a satisfies 

V 2 <j> a = Z aA B\ A \ B ■ (10.22) 

Fortunately, we have already solved the covariant Laplace equation in the ADHM background a bi- 
fermion source (see §IV.3 and Appendix C Eq. (C.25)). 

X.2.3 The effective action 

We now substitute our solution into the action of the theory and extract the terms of order n = 2. At 
this order, the terms which contribute to the effective action are 

5 (2) = J_ f d D ytTN f FnaF na _ lX A ^ AB V a \ B } ■ (10.23) 

We now evaluate this expression. First of all, we have 

F na = V n A a - d a A n = iV n <p a - S^AndaX" (10.24) 

73 Note that the possible cross-terms between fields of lower order, n = and n = 1/2, and fields of higher order, 
n = 3/2 and n = 2, which potentially could contribute to the effective action at orders < 2, actually vanish by the 
equations-of- motion. 
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and, writing X B = A(M B ), 

X A fl AB V a X B = \« A ^ AB [K{d a M B ) + ?^d a X» - K a (?±±-)d a X» + [A a , X B ] j 

= \ aA Z AB {A a (d a M B ) + (M + [n^Af] - A Q (^))^ + # a ,Af]} 
Consequently, using the gauge condition (2.24), the action (10.23) becomes 

5(2) = 1. f d D ytTN L,A n d a X^ u A n d a X" - iX A Z a AB A(d a M B ) 
9d J I 

/ d\ B (9 KA B \ 

- *x A z AB + [n„ \ B ] - A (^xjr))daX» - v n rv n <p a + x A ^ AB [<p a , x 1 



;i0.25) 



(10.26) 



The first two terms can be evaluated using the inner-product formulae (2.32) and (4.37). The third 
term can be evaluated using the identity (C.51) established in Appendix C: 

^ + K ,X*}=tel + A(^L), (1027) 

where 

r)n a 

f A = \U^rJM A U . (10.28) 
Therefore the third term in (10.26) involves the integral 



d A xti N X A TpQ B . (10.29) 
We can now use the fact that f>X A = to write this as 

d 4 xd a& (tr N X aA g AB ) . (10.30) 



Using the asymptotic formulae in §11.4.3, one sees that the surface term at infinity vanishes and so 
there is no contribution from the third term in (10.26). The final two terms (10.26) can be evaluated 
following an identical analysis to the construction of the instanton effective action in §V.2. Notice that 
this term vanishes in the p — 1 case. 74 

Putting everything together we have 

5 (2) = jL f tf+^trJ - \g^{X)d a X»d a X" - 2m 2 f, a AB Jl A d at x B - m 2 Y? AB M! A d a M! B 

9dJ 1 (10.31) 

+ We ABCD M A M B L- l M c M r 



This expression is somewhat schizophrenic because the bosonic fields X M are the intrinsic coordinates 
while the fermionic fields Ai A are the ADHM variables, subject to the fermionic ADHM constraints. 
In order to unify things we can procedure in two alternative ways, either writing everything in terms 



74 Note the fact that we arc in Minkowski space does not affect the result since {^AB^aCD) Mink = (^aAB^aCo) Eucl 
Zcabcd- 
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of intrinsic variables to or in terms of the ADHM variables with the ADHM constraints explicitly 
imposed. Both viewpoints are worth developing. 

First the intrinsic expression. In §IV.2.2, we introduced ip , % — 1, . . . , 2kN, the intrinsic Grassmann- 
valued symplectic tangent vectors to 9JTfc. In (10.31), the terms quadratic in the fermions are written 
in terms of the intrinsic objects ip A as 

in 2 Z a AB (2ji A d a ii B + M' A d a M' B ) = 4Z AB n(M(i> A ,X),M(d a i> B ,X) + -^-d a X») (10.32) 

using the linearity of A4 A on %jj A . The first term can be written in terms of the intrinsic symplectic 
matrix Q{j(X) defined in (4.39). The relevant geometrical quantity corresponding to the second term 
is the symplectic spin connection on 971^. In Appendix B we explain in the context of the hyper-Kahler 
quotient construction how the connection V on the quotient space is inherited from the connection V on 
the mother space 9Jt by orthogonal projection to "K. In a completely analogous way the spin connection 
for symplectic tangent vectors on DJlk is obtained from that of 971 by projection via the fermionic ADHM 
constraints. So since M A are subject to the fermionic ADHM constraints 

Q(M A , V X M B ) = Q(M A , V X M B ) . (10.33) 

Then since Wl is flat, we can identify the second term in (10.32) with 

A^a^X^uj^daX^ , (10.34) 

where u^j are the elements of the symplectic spin connection. 

The final term, as we have explained in §V.2, involves a coupling to the symplectic curvature of 971^. 
Writing all the terms using in the intrinsic variables X^ and tp lA , we have 

s(2) = -7^r I d p+1 dg,u(x)d a x^d a x^ + ^ aAB n ij (x)^ A (dj\ + ^ 

2 9dJ 1 (10.35) 

In both cases p — 1,5, this is the action of a non-linear a- model with 971^ as target with, as we shall show 
in §X.2.4, M = (0,4) or M = (0,1) supersymmetry, respectively. For references on two-dimensional 
Af = (0,4) theories see [148,149], while for six-dimensional Af = (0,1) theories see [153,154] and 
references therein. 



We now follow the opposite philosophy and attempt to write the effective action (10.31) entirely in 
terms of the ADHM variables a„ and M A . In this form, the bosonic, as well as the fermionic, ADHM 
constraints are now implicit. We claim that the correct expression is 

S(2) = 4 / d p+1 £tr k { - 4d a w«d a w« - Ad a a' n d a a' n - 2ii: AB fi A d a fi B - ii: AB M' aA d a M' B 

3d J 1 (10.36) 

+ \ [^ AB M A M B + 2t(a«d a a„ - d a a«a & )] L 1 [Z aCD M c M D + 2t(a^d a a^ - d a a^a p )] } . 

We must show that this reduces to (10.31) when we substitute = da(X). Firstly, we have to recall 
the expression (2.117) for the metric on the quotient space in terms of the ADHM variables a^iX). This 
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accounts for the first two terms in (10.36). Most of the fermionic terms in (10.36) are already present in 
(10.31) in particular the fermion quadrilinear term is equal to that in (10.31) by the E-matrix identity 
(A. 20). We seem to have the additional terms dependent on a a d a aa — d a a a ad, however this vanishes 
when we substitute aa(X^) since 

a a d a a A - d a a a a d = (a Q ^ - — a*)^" = , (10.37) 
by virtue of the constraint (2.114). 

The form (10.36) is motivated by the fact that one can introduce a non-dynamical U(k) (Hermitian) 
gauge field Xa coupled to the fields via the covariant derivatives 

V a w & = d a w & - iWaXa , V = d a w a + %XaW a , V a a' n = d a a' n + i[xa, a' n ] , 
V af i A = d af i A - tfi A Xa , V a fi A = d a fi A + t Xa fi A , V a M' A = d a M' A + t[ X a,M' A ] . 
The action (10.36) can then be written in the form of a gauge linear a-model 

A 2 P 

S (2) = _ J_ / dP+ i^ tl J V a^ VaW . + V a a' n V a o! n + I i: a ABl 2 A V a fi B + lZ a AB M' A V a M' B } . (10.39) 

Remember that the bosonic and fermionic ADHM constraints are imposed implicitly. This can be 
achieved directly by introducing, as in §VI.5, bosonic and fermionic Lagrange multipliers in the form 

— * — . 

of k x k Hermitian matrix fields, D and ip% respectively: 

47T 2 ? f r - 

SU. = — / d p+1 £tr k \ D ■ T^pP a & + n(M A a„ + a«M A ) } . (10.40) 
9d J L J 

The non-dynamical gauge field, along with the Lagrange multipliers then form a vector multiplet 
{Xa, D, ^a} of the supersymmetry appropriate to the p — 1 and p = 5 cases. 



X.2.4 Supersymmetry 

Just as the supersymmetry of the parent theory is inherited by the collective coordinates of instanton, 
we expect the same to be true for the instanton branes. The only difference now is that the collective 
coordinates are now fields and so the supersymmetry transformations will involve £ a derivatives. 

From (10.10a)-(10.10b), one finds 

5A m = ii A a m \ A + iUa m \ A , (10.41a) 
sx a = iTi abA B ^B Fab + lGmni A Fmn + E aAB aU B F am . (10.41b) 

These transformations are closely related to (4.23a) and (4.23b), respectively. (Indeed, by removing £ a 
derivatives and replacing A a by i<p a they are identical.) Now we let A rn and X A take their ADHM values 
with ^"-dependent collective coordinates and A a as in (10.21). The variations lift to variations of the 
collective coordinates in an almost identical way to those derived in §IV.5. The only difference, is an 
extra contribution from the final term in the variation of \ A . Using (10.21), this term is 

-VL aAB t^alB + ^ aAB {d a A n - %d a X»)a n l A . (10.42) 
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The first term, here, is identical to the final term in (4.23b) and so is already accounted by the analysis 
in §IV.5. It is the second term which is new. Since 

8A 

d a A n - il^X" = ^d a X» - n^d a X» = b^A n (10.43) 

we can use the explicit expression for the zero modes (2.113) to write the final term as 

2Z aAB ^A^)d a X» . (10.44) 

So there is an extra ^"-derivative term, relative to (4.70), in the transformation of the Grassmann 
collective coordinates: 

5M A = -Aiitb a + 2iH aAB C a ^% + TL aAB d a a h l% , 
5M A = -AiC A K + 2iY, aAB ^ B C: + 2Z aAB d a a«U B . 

These transformations are symmetries of the effective action (10.31). In counting the number of super- 
symmetries, we do not include the trivial shifts of the fermions generated by ^ A and this means that the 
effective theory of the instanton branes has half the number of supersymmetries of the parent theory: 
so 8 for the six-dimensional (p = 5) theory and 4 for the two-dimensional (p = 1) theory. In both cases 
the supersymmetries are all anti-chiral the hence, using the usual convention, are denoted as Af = (0, 1) 
and M — (0, 4), for p — 5 and p — 1, respectively. 



X.2.5 Relation to the instanton calculus 



The instanton calculus for Af = 2 and Af = 4 supersymmetry can be obtained as a particular limit of 
the a-model described above. This limit involves performing a Wick rotation of the p + 1-dimensional 
world- volume of the brane and then a dimensional reduction to zero dimensions. In this limit, the 
field theory reduces to a matrix theory and the only part of the "dynamics" that remains, in the case 
with p = 5 (8 supercharges) is the four-fermion coupling in (10.35). This four-fermion coupling then 
reproduces the quadrilinear coupling of the Grassmann collective coordinates described in §V.2. On 
top of this, the collective coordinate integral of the instanton calculus is obtained directly from the 
Wick rotated partition function of the dimensionally reduced a-model. This is most easily seen in the 
gauged linear a-model description of Eq. (10.39). On dimensional reduction the partition function of 
the a simply gives the instanton partition in the linearized version constructed in §VI.5. One can see 
that the auxiliary variables {x a , D, i/ja} introduced in §VI.5 arise from the dimensional reduction of the 
fields of the vector multiplet of the a-model. 

It is interesting to ask whether the constrained instanton formalism describing the Coulomb branch 
of the Af = 2 and Af = 4 theories can be re-produced in this way. To answer this question, we use 
the fact that for our six-dimensional a-model with 8 supercharges, relevant to the Af = 4 case, it 
is possible when dimensionally reducing to add a potential which do not break supersymmetry [150]. 
This potential term has the form of the inner product of a tri-holomorphic Killing vector field on the 
hyper-Kahler target space. In fact the most general type of potential for an 8 supercharge a-model in 
6 — /-dimensions involves the sum of the inner-products of I commuting tri-holomorphic Killing vector 
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fields [143,151]. At the level of the non-linear cr-model the introduction of this potential can be described 
by non-trivial dimensional reduction of the Scherk-Schwarz kind [152]. At the level of the linear cr-model 
we can describe it as gauging an additional U(l)' symmetry, over and above U(A;), corresponding to the 
action of the tri-holomorphic Killing vector fields, and then giving VEVs to the / components of the 
six-dimensional gauge field Xa that lie in the dimensionally reduced directions. 

In the present setting of the instanton calculus, DJlk admits a SU(iV) tri-holomorphic action cor- 
responding to global gauge transformation in the original SU(iV) gauge theory. The action of these 
transformations on the bosonic ADHM variables was described in §11.4.3. Under global gauge transfor- 
mations a' n and M.' A are invariant, but 

Wa -> UWa , // -> Ufl A . (10.46) 

Commuting actions can be obtained by considering global gauge transformations in the Cartan subgroup 
U(l) iV - 1 c SU(iV) generated by diagonal matrices. We now gauge this symmetry and reduce to zero 
dimensions. This amounts to replacing the covariant derivatives in (10.39) by 

T> a w & -> -iw & Xa ~ i4>° a w & , -> ix a w a + iw a (f) a , V a a' n -> -i[a' n , Xa] , 

(10 47) 

VafI A ^ _ ifi A Xa _ ^ Vafi A ^ iXa -A + -^0 ^ VaM ,A ^ _ i[M ^ 

where 0° are six diagonal N x N matrices. It is easy to see that after integrating out the gauge field Xa 
one is left with the instanton effective action describing constrained instantons on the Coulomb branch 
of M = 4 gauge theory (6.94). 

In a similar way, the constrained instanton calculus of the M = 2 theory can be obtained by gauging 
the same U(l) Ar_1 symmetry and dimensionally reducing to zero dimensions the two-dimensional a- 
model with M = (0,4) supersymmetry. In this case, there is another kind of generalization which plays 
a role in the instanton calculus. Since the M = (0,4) supersymmetry is purely chiral it is possible 
to fermions of the opposite chirality which are singlet under the supersymmetry. This is precisely 
what is required to describe the instanton calculus when the J\f = 2 gauge theory involves additional 
matter hypermultiplets. As described in §VI.3.2, instantons have additional fermion zero modes and 
there are new Grassmann collective coordinates {/C/,AC/}. Consequently the cr-model describing the 
collective coordinate dynamics will have additional fermionic fields describing the fluctuations of the 
hypermultiplet Grassmann collective coordinates. These are incorporated in the two-dimensional cr- 
model dynamics in precisely the same way as in description of monopole dynamics in an M = 2 theory 
with hypermultiplets described in [144] (although in this reference the a-model is in one dimension, 
i.e., is a quantum mechanical system). 

X.3 Instantons and string theory 

The most remarkable and unexpected developments of the instanton calculus has come with the real- 
ization that the ADHM formalism arises naturally in the context of string theory. The point is that 
supersymmetric instanton branes, as previously described in §X.2, arise when D-branes are "absorbed", 



165 



in a way to be made precise, on other D-branes. In certain respects, the string theory point-of-view 
provides an "explanation" for the rather mysterious ADHM construction in the sense that the ADHM 
variables, constraints and the internal U(fc) symmetry have a simple interpretation in terms of a con- 
ventional (but auxiliary) gauge theory with gauge group U(k) (see Refs. [155-157]). Moreover, the 
connection is very explicit: not only can the ADHM gauge potential be derived [158] but the cx-model 
describing the dynamics of instanton branes is obtained in a direct way [21]. Both are obtained in a 
certain decoupling limit where stringy effects can be neglected. In this way one "derives" the leading- 
order semi-classical expression for the collective coordinate integral of the original gauge theory. Some 
aspects of the relation between Yang-Mills and D-instantons, particularly for k — 1, are also discussed 
in Refs. [159,160]. 

X.3.1 The Af = 4 instanton calculus 

We begin by describing how the Af = 4 instanton calculus can be recovered from string theory. The 
basic idea is that the Af = 4 theory with gauge group U(iV) arises as the collective dynamics of N 
D3-branes in Type IIB string theory and, according to [156, 156], D-instantons located on the D3- 
branes are equivalent to Yang-Mills instantons in the collective coordinate world-volume gauge theory. 
More generally, Dp-branes located on a stack of D(p + 4)-branes appear as instanton p-branes in the 
world-volume theory of the higher-dimensional branes. So our attention will focus a general system of 
k Dp-branes and N Y){p + 4)-branes. 

We begin by briefly reviewing some basic facts about D-branes in Type II string theory [161]. There 
are two distinct ways to think of a Dp-brane. Firstly, it can be viewed as a brane-like soliton of the Type 
II supergravity in ten dimensions, the low energy limit of the string theory. In this respect, we expect 
that in an appropriate limit its behaviour should be captured by a generalization of Manton's moduli 
space dynamics: in this case some (p + l)-dimensional field theory on the world volume. Secondly, a Dp- 
brane can be described as a p-dimensional hyper-plane on which open strings can end. The connection 
between these two points-of-view is that the massless states in the open string spectrum in the presence 
of the Dp-brane give rise to massless fields which propagate on the p + 1-dimensional world volume of 
the D-brane and these massless modes are identified with the collective coordinates of the moduli space 
approximation. 

Specifically, the massless modes or collective coordinates of a single Dp-brane come from a simple 
dimensional reduction to p + 1 dimensions of a U(l) vector multiplet of Af = 1 supersymmetry in 
ten dimensions. This supersymmetric theory, which has 16 supercharges, consequently describes the 
moduli space dynamics of the D-brane. The action is (10.9) with D = 10 dimensionally reduced to 
p + 1 dimensions. After dimensional reduction, the ten-dimensional gauge field An, N = 0, 1,2, . . .9, 
yields a p + 1 dimensional gauge field A n , n = 0, . . . ,p and 9 — p scalars <p a , a = 1, ... ,9 — p. In 
the collective coordinate interpretation, the scalars specify the location of the D-brane in the 9 — p 
dimensions transverse to its world volume. The d — 10 multiplet also includes a Majorana-Weyl 
fermion The sixteen independent components of \I/ correspond to the sixteen fermion zero modes of 
the D-brane. This number reflects the fact that the D-brane is a BPS configuration which breaks half 
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of the 32 supersymmetries of the Type II theory. 

Remarkably, the collective dynamics of a system of N parallel Dp-branes is obtained by simply 
"non-abelianizing" the U(l) gauge group thereby replacing it with U(iV). At low energies, the adjoint- 
valued scalar fields can acquire VEVs breaking the gauge group to XJ(1) N describing a configuration 
of Dp-branes separated in the transverse directions. However, when two or more D-branes coincide, 
additional states corresponding to open strings stretched between the two branes become massless 
leading to enhanced gauge symmetry [163]. In the maximal case, where all N Dj9-branes coincide, the 
unbroken gauge group is U(iV). The low-energy effective action for the world- volume theory can be 
obtained from dimensional reduction of ten-dimensional super- Yang-Mills theory (10.9) (with D = 10) 
with gauge group U(iV). Dimensional reduction to p + 1 dimensions proceeds by setting all spacetime 
derivatives in the reduced directions to zero. As in the case of a single Dp-brane, the ten-dimensional 
gauge field yields a p + 1-dimensional, but now U(iV)-valued, gauge field and 9 — p real adjoint scalars. 
Configurations with some or all of the D-branes separated in spacetime corresponds to the Coulomb 
branch of the world- volume gauge theory. In terms of string theory parameters, the Yang-Mills coupling 
constant in p + 1 dimensions is identified as 

g 2 p+1 = 2(27rr W (P " 3)/2 , (10.48) 
where g st is the string coupling constant, a' is the inverse string tension. 

In the case of N parallel D3-branes, the low-energy effective theory on the brane world-volume 
is four-dimensional M = 4 supersymmetric Yang-Mills theory with gauge group U(iV) and coupling 
constant g 2 = An g st . In this case we can also introduce a world- volume vacuum ang le = 27rC(°) where 
is the VEV of the Ramond-Ramond scalar of the IIB theory. In general, the four-dimensional fields 
which propagate on the brane also have couplings to the ten-dimensional graviton and the other bulk 
closed string modes. To decouple the four-dimensional theory from these bulk modes it is necessary to 
take the limit a' — > with with g (i.e. g st ) held fixed and small. This gives a weakly coupled gauge 
theory on the D3-brane. 

A D-instanton (or D(— l)-brane) corresponds to the extreme case where the dimensional reduction 
is complete and the world-volume is a single point in Euclidean spacetime. 75 Correspondingly, rather 
than having finite mass or tension, a single D-instanton has finite action 

2tt(#- 1 - iC i0) ) = -2-Kir , (10.49) 

using the relations (10.48). Hence, a D-instanton carries the same action as a Yang-Mills instanton in the 
world volume gauge theory of N D3-branes described above. From our discussion above, the collective 
coordinates of a charge-A; D-instanton correspond to a U(k) vector multiplet of ten-dimensional Af = 1 
supersymmetric gauge theory dimensionally reduced to zero spacetime dimensions. As we dimensionally 
reduce to zero dimensions, the fields become both c-number and Grassmann matrix degrees-of-freedom. 
In addition to a constant part equal to —2irikr, the action of a charge-/c D-instanton also depends on 
the collective coordinates via the (Wick rotated) dimensional reduction of (10.9) (with D — 10): 

S = -itr fc [A M , A n ] 2 + \ti k ^T M [A M , *] . (10.50) 
75 So before dimensional reduction, we must Wick rotate the D = 10 action (10.9) to Euclidean space. 
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In addition to the manifest SO(10) symmetry under ten-dimensional rotations, the action is trivially 
invariant under translations of the form Am — > Am + ^mI^xm- Hence k~Hrk Am, which corresponds 
to the abelian factor of the U(fc) gauge group, is identified with the position of the centre of the charge 
k D-instanton in M 10 . 

The action (10.50) inherits the supersymmetries (10.10a)-(10.10b) along with linear shifts in the 
Grassmann collective coordinates: 



The sixteen components of the Majorana-Weyl SO(10) spinor e correspond to the sixteen zero modes 
of the D-instanton configuration generated by the action of the D = 10 supercharges. Like the bosonic 
translation modes, these modes live in the abelian factor of the corresponding U(fc) field, k~ 1 tik VP. In 
contrast, the Majorana-Weyl spinor S parameterizes the sixteen supersymmetries left unbroken by the 
D-instanton. 

In ordinary field theory, as we have seen, instantons yield non-perturbative corrections to correlation 
functions via their saddle-point contribution to the Euclidean path integral. In the semi-classical limit, 
the path-integral in each topological sector reduces to an ordinary integral over the instanton moduli 
space (5.14). The extent to which similar ideas apply to D-instantons is less clear, in part because 
string theory lacks a second-quantized formalism analogous to the path integral. Despite this, there is 
considerable evidence that D-instanton contributions to string theory amplitudes also reduce to integrals 
over collective coordinates at weak string coupling [133]. In this case the relevant collective coordinates 
are the components the ten-dimensional U(fc) gauge field Am and their superpartners \& which in the 
dimensional reduction end up as matrices. According to Green and Gutperle [133,138,164,165], the 
charge-/c D-instanton contributions to the low-energy correlators of the IIB theory are consequently 
governed by the partition function 



This partition function can be thought of as the collective coordinate integration measure for k D- 
instantons. In particular, the leading-order semi-classical contribution of k D-instantons to the correla- 
tors of the low-energy supergravity fields is obtained by inserting into (10.52) the classical value of each 
field. The collective coordinate action (10.50) does not depend on the U(l) components of the fields Am 
and Hence, to obtain a non-zero answer, the inserted fields must, at the very least, involve a product 
of at least sixteen fermions to saturate the corresponding Grassmann integrations. As in field theory 
instanton calculations, the resulting amplitudes can be interpreted in terms of an instanton-induced 
vertex in the low-energy effective action. The spacetime position of the D-instanton, X M = k~Hr k A N , 
is interpreted as the location of the vertex. In particular, the work of Green and Gutperle [133] has 
focused on a term of the form i? 4 in the IIB effective action (here R is the ten-dimensional curvature 
tensor) and its supersymmetric completion. 

So far we have only considered D-instantons in the IIB theory in a flat ten-dimensional background 
and in the absence of other branes. In order to make contact with four-dimensional gauge theory we need 
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to understand how these ideas apply to D-instantons in the presence of D3-branes. In particular we wish 
to determine how the D-instanton collective coordinate integral (10.52) is modified by the introduction 
of N parallel D3-branes. Conversely, in the absence of D-instantons, the theory on the four-dimensional 
world volume of the D3-branes is M = 4 Yang-Mills with gauge group U(iV). Hence a related question 
is how the D-instantons appear from the point-of-view of the four-dimensional world- volume theory of 
the D3 branes. In fact, the brane configuration considered here is a special case of a system which has 
been studied intensively involving k Dp-branes in the presence of N D(p + 4)-branes, with all branes 
parallel. As we will review below, the lower-dimensional branes corresponds to Yang-Mills instantons 
in the world- volume gauge theory of the higher-dimensional branes [156]. We begin by reviewing the 
maximal case p — 5, which was first considered (in the context of Type I string theory) by Witten [155]. 
The cases with p < 5 then follow by straightforward dimensional reduction. 

We start by considering a theory of k parallel D5-branes (in Type IIB string theory) in isolation. As 
above, the world-volume theory is obtained by dimensional reduction of ten-dimensional M = 1 Yang- 
Mills theory with gauge group XJ(k). The resulting theory in six dimensions has two Weyl supercharges 
of opposite chirality and, therefore, conventionally has M = (1, 1) supersymmetry. 76 After dimensional 
reduction, the SO(10) Lorentz group of the Minkowski theory in ten dimensions is broken to H = 
SO(5, 1) x SO(4). The SO(5, 1) factor is the Lorentz group of the six-dimensional theory while the 
SO (4) is an i?-symmetry. The ten-dimensional gauge field Am splits up into an adjoint scalar in the 
vector representation of SO (4) and a six dimensional gauge field. Explicitly we set 

A M = i(xa,^ra' n ^ , a = l,...,6; n = l,...,4. (10.53) 

Since, in our conventions, A M is anti-Hermitian, both a' n and Xa are Hermitian. The factor of (2na / )^ 1 
has been inserted so that a' n will subsequently be identified with the quantity of the same name in the 
instanton calculus. 

In order to describe the fermion content of the theory consider the covering group of H, H — 
SU(4) x SU(2)x, x SU(2)#. We introduce spinor indices A = 1,2,3,4 and a, a — 1,2 corresponding to 
each factor. As mentioned above, a ten-dimensional Majorana-Weyl spinor is decomposed under H as 

16 -(4, 2,1) ©(4, 1,2) , (10.54) 

so contains two Weyl spinors of opposite chirality in six dimensions. With the representation of the 
ten-dimensional Clifford algebra as in (10.11) a Majorana-Weyl fermion, as in (10.13), can be written 

*= sM9 8 0*'« + © (?)**• (ia55) 



Altogether, the fields (x Q , a' n , M'^, 4>a) form a vector multiplet of M = (1, 1) supersymmetry in six 
dimensions. In terms of an M = (0, 1) subalgebra, the M = (1, 1) vector multiplet splits up into an 
J\f = (0, 1) vector multiplet containing {Xaj^A.} an d an adjoint hypermultiplet containing {a' n , M!£}- 
The action of the Af = (1, 1) theory is deduced from (10.18) (with g D — > g 6 ): 

S = — g f'S'gaugc + ~j 2 /2 ^matter) = ~ a '-'gauge + ^matter) > (10.56) 
9q v 47r a 7 9p+5 V 7 

76 Some convenient facts about six-dimensional supersymmetry are reviewed in [166]. (See page 67 in particular.) 
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(in the present case p — 5) where 

Sgaugc = J d^tv k [\F 2 ab - iH aAB i> A V a i> B + \D mn ] (10.57) 

and 

S£l tter = J d^tT k {v a a' n V a a' n - {E AB M A V a M' B - t[M' aA , a'^A + & • ^Xd} • (10.58) 

Here, the covariant derivatives are for adjoint- valued fields: T> a a' n = d a a' n + i[Xa, a ' n ], etc. For later 
convenience we have introduced a real anti-self dual auxiliary field for the vector multiplet, D mn = 
—*D mn ), transforming in the adjoint of SU(2) R . Since D mn is anti-self-dual we can write 

D mn = tr 2 (ra mn ) • D , (10.59) 

which defines the 3-vector D. We have also used the relation g 2 +5 = {2ix) A a' 2 g 2 +1 . 

Following [156], the next step is to introduce iV D9-branes of the Type IIB theory whose world- 
volume fills the ten-dimensional spacetime. 77 These will give rise on dimensional reduction to the N 
D3-branes in the p = — 1 case on which we will eventually focus. The world volume theory of the 
D9-branes in isolation (i.e. in the absence of the D5-branes) is simply ten-dimensional U(iV) Af = 1 
supersymmetric gauge theory. As explained by Douglas [156], the effective action for this system 
contains a coupling between the 2-form field strength F of the world-volume gauge field and a 6-form 
field which comes from the Ramond-Ramond sector of the Type IIB theory. This coupling has the 
form, 

J C (6) AFAF, (10.60) 

where the integration is over the ten-dimensional world-volume of the D9-branes. The same six-form 
field also couples minimally to the Ramond-Ramond charge carried by D5-branes. Hence a con- 
figuration of the U(iV) gauge fields with non-zero second Chern class, F A F, acts as a source for 
D5-brane charge. More concretely, if the D9-brane gauge field is chosen to be independent of six of 
the world-volume dimensions and an ordinary Yang-Mills instanton is embedded in the remaining four 
dimensions, then the resulting configuration has the same charge density as a single D5-brane. Both 
objects are also BPS saturated and therefore they also have the same tension. Further confirmation of 
the identification of D5-branes on a D9-brane as instantons was found in [156] where the gauge-field 
background due to a Type I D5-brane was shown to be self-dual via its coupling to the world-volume 
of a Dl-brane probe. 

As described above, D5-branes appear as BPS saturated instanton configurations on the D9-brane 
which break half of the supersymmetries of the world-volume theory. Conversely, the presence of D9- 
branes also breaks half the supersymmetries of the D5-brane world-volume theory described by the 
action (10.56). Specifically, the Af = (1,1) supersymmetry of the six-dimensional theory is broken 
down to the Af = (0, 1) subalgebra described above equation (10.56). To explain how this happens we 
recall that open strings stretched between branes give rise to fields which propagate on the D-brane 

77 In fact a IIB background with non- vanishing D9-brane charge suffers from inconsistencies at the quantum level. This 
is not relevant here because the D9-branes in question are just a starting point for a classical dimensional reduction. 
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world-volume. So far we have only included the adjoint representation fields which arise from strings 
stretching between pairs of D5-branes. As our configuration now includes both D5- and D9-branes there 
is the additional possibility of states corresponding to strings with one end on each of the two different 
types of brane. As the D5-brane and D9-brane ends of the string carry U(k) and U(iV) Chan-Paton 
indices respectively, the resulting states transform in the (k, N) representation of U(A;) x U(iV). 

In fact the additional degrees-of-freedom transform as hypermultiplets of Af = (0, 1) supersymmetry 
in six dimensions [155]. As these hypermultiplets cannot be combined to form multiplets of Af = (1, 1) 
supersymmetry, the residual supersymmetry of the six dimensional theory is A/" = (0, 1) as claimed 
above. Each hypermultiplet contains two complex scalars w ui a, a — 1,2. Here, as previously, % and u are 
fundamental representation indices of U(/c) and U(iV) respectively. The fact that hypermultiplet scalars 
transform as doublets of the SU(2) i?-symmetry is familiar from Af = 2 theories in four dimensions. 
Each hypermultiplet also contains a pair of complex Weyl spinors, /i^ and Jlf u . The six-dimensional 
action for the hypermultiplets can be deduced from the action (10.58) of the {a' n , A4 A } hypermultiplet: 

^^mitter = [ ^tr fe { - VVV a Wa - ^^V^ - l^W* + W &f l, A )$% + iD ■ T^^wA . 

(10.61) 

The scalar and fermion kinetic terms in the above action include the fundamental representation co- 
variant derivative, T> a w = d a w — iwx a , etc. The remaining two terms in (10.61) are the fundamental 
representation versions of the Yukawa coupling and D-terms which appear in (10.58). The complete 
action of the six-dimensional theory is then the amalgam of (10.57), (10.58) and (10.61): 

S = ifl (4tt V 2 S gaugc + it tcr + S£ tter ) . (10.62) 

9p+5 v 7 

The various fields of the six dimensional theory and their transformation properties under U(fc) x U(iV) x 
H are collected in Table 10.1. 
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Table 10.1: Transformation properties of the fields 
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The J\f = (0, 1) supersymmetry transformations for the theory can be deduced from the super- 
symmetry transformations of ten-dimensional Yang-Mills theory in Eqs. (10.10a) and (10.10b). The 
J\f = (0,1) supersymmetry of the six-dimensional action (10.57), (10.58) and (10.61) is then obtained 
as the subalgebra of this ten-dimensional N = 1 supersymmetry by taking 




(10.63) 



This yields the transformations 

= %«aM'£ , 5M' A = 2Z aAB ^V a a' a& , (10.64a) 
5 Xa = i^Ui'B , #a = Y> ab A B F ab i B - ia mn D mn U . (10.64b) 

The transformations act on the fundamental hypermultiplets {iVa, fi A } in an analogous way to the action 
on the adjoint hypermultiplet {a' n , Ai' A }: 

Swa = i^ A ii A , 5fi A = TL aAB C B V aWdi . (10.65) 



The reader will have noticed that we have chosen our notation so that each hypermultiplet field has 
a counterpart, denoted by the same letter, in the M = 4 instanton calculus. In particular the various 
indices on these fields correspond with those on the corresponding ADHM variable. The physical reason 
for this correspondence is simple: the six-dimensional fields are the collective coordinates of the D5- 
branes. As the D5-branes are equivalent to Yang-Mills instantons, the vacuum moduli space of the U(fc) 
gauge theory on the D5-branes should coincide with the Af = 4 supersymmetric fc-instanton moduli 
space described by the ADHM construction. As the only scalar fields in the six-dimensional theory lie in 
M = (0, 1) hypermultiplets, the relevant vacuum moduli space is conventionally referred to as a Higgs 
branch. Precisely how the proposed equivalence arises was explained in [155]. The Higgs branch is 
described by the vanishing of the D-term. The D-term equation-of-motion which follows from varying 
the action is 

®' 2 D = -^- 2 r% (w*w & + a'^a' aa ) . (10.66) 

— # 

So the condition for a supersymmetric vacuum, D = 0, is equivalent to the ADHM constraints (2.65). 
Since global U(k) gauge transformations relate equivalent vacua, we see that the Higgs branch of theory 
yields precisely the ADHM construction of the instanton moduli space DJlk in the form of a hyper- 
Kahler quotient. This is a particular example of the more general fact that the Higgs branch of a 
gauge theory with 8 supercharges is a hyper-Kahler quotient (for example see Refs. [147,167,168]). The 
general construction involves a gauge group G and hypermultiplets transforming in some representation 
of G. The canonical kinetic terms of the hypermultiplet scalars z ia defines the metric on the mother 
space (B.35) and the gauge group action defines the vector fields X r in (B.37). The auxiliary fields D 

— * 

are the moment maps (B.39). Setting D = and dividing by global gauge transformations gives the 
hyper-Kahler quotient construction. 78 

At a generic point on the Higgs branch, the U(fc) gauge symmetry is completely broken and at low 
energies the vector multiplet can be integrated out leading to a six-dimensional M = (0, 1) supersym- 
metric ex-model with 971^ as target space. At leading order, the low-energy effective action is obtained 
78 Note the ( c in (B.39) are precisely Fayet-Illiopolos terms for any abelian factors of G. 
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by simply removing S gauge , the kinetic terms of the vector multiplet, from the action (10.62). In this 

— * — 

limit, the vector multiplet fields {x a , D,%Pa} become auxiliary and the resulting theory is precisely the 
linear a-model describing the moduli space dynamics of instanton branes in ten-dimensional M = 1 
gauge theory constructed in §X.2. Note in this low-energy limit, D and if) a become the Lagrange mul- 
tipliers (10.40) for the c-number and Grassmann ADHM constraints, (2.65) and (4.34), respectively, as 
. Moreover, Xa is the auxiliary U(k) gauge field in the linear action (10.39) (also, up to a re-scaling, the 
auxiliary variables that we used to bi-linearize the action in §X.2.3). At higher energies, or for small 
VEVs, where non-abelian subgroups are restored, the stringy corrections involving the kinetic terms for 
the vector multiplet become increasingly important and the simple moduli space picture breaks down. 

Starting from the configuration of D5- and D9-branes described above, the general case of parallel Dp 
and D(p + 4) -branes with p < 5 can be obtained by dimensional reduction on the brane world-volumes. 
In this case the vacuum structure of the gauge theory describing the brane configuration is richer. The 
reason is that on dimensional reduction 5 — p of the components of the gauge field Xa, a = p + 1, . . . ,5, 
become adjoint scalars and can develop VEVs. These scalars describe the freedom for the Dp-branes 
to move off into the 5 — p dimensions transverse to both the Dp and D(p + 4)-branes. In addition to 
the ADHM constraints (2.65), the classical equations for a supersymmetric vacuum now include 

[Xa, Xb\ = W aXa = K> Xa] = . (10.67) 

The classical vacuum moduli space consists of distinct branches. First of all, there is a Coulomb 
branch where Xa and a' n are diagonal with Wa = and on which the gauge group is broken to U(l) fc . 
This corresponds to situation where the Dp-branes are located (generically) in the bulk away from 
the D(p + 4)-branes. The diagonal elements of x& give the positions of the k Dp-brane transverse to 
the D(p + 4)-branes while the diagonal elements of a' n gives the positions of the Dp-branes along the 
world-volume of the D(p + 4)-branes. When a given element (xa)u goes to zero, the corresponding Dp- 
brane touches the world-volume of the D3-branes. In this case, it is clear from (10.67) that it is then 
possible for w ui a (for the given value of i) to become non-vanishing. The Dp-brane is then "absorbed" 
onto the D(p + 4)-branes. On this new branch of the vacuum moduli space the Dp-brane becomes a 
genuine Yang-Mills instanton with a non-zero scale size pf = ^wf u w U ia (no sum on i). Other kinds of 
"mixed" branches arise when more of the k Dp-branes are absorbed into the D(p + 4)-branes. The Higgs 
branch described the situation where all the Dp-branes have been absorbed into the D(p + 4)-branes 
and so Xa — 0. Notice that the vacuum moduli space for on the Higgs branch with p < 5 continues 
to be the instanton moduli space DJlk- In the reverse sense, when an instanton shrinks down to zero 
size, i.e. becomes an ideal instanton in the gauge theory on the world- volume of the D(p + 4) -branes, it 
becomes a Dp-brane which can move off into the bulk. These transitions move through the points where 
the different branches of the vacuum moduli space touch. As long as p > 1, so that the world- volume 
theory on the Dp-branes is more than two-dimensional, we expect the qualitative picture of distinct 
branches to be valid, at least semi-classically. However, the points at which the different branches touch 
correspond to situations where a non-abelian subgroup of the gauge group is restored. The theory 
here will be strongly coupled and so we expect the moduli space description to break down. These 
are regimes where we cannot ignore the kinetic terms of the vector multiplet and stringy corrections 
are expected to be important. The situations with p < 1 are rather different since there can be no 
symmetry breaking in this case. However, the picture of the low-energy dynamics being described by 
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cr-models on the appropriate moduli spaces is still expected to be valid. 



In particular the case of DO /D4-branes in the Type IIA theory has been studied extensively because 
of its application as a light-cone matrix model of the M = (2,0) theory in six dimensions [169]. In 
this case, the DO-branes correspond to solitons in the 4 + 1 dimensional gauge theory on the D4-brane 
world-volume. These solitons are just four-dimensional Yang-Mills instantons thought of as static finite- 
energy configurations in five dimensions. At weak coupling, we expect the dynamics of these solitons to 
be correctly described by supersymmetric quantum mechanics on the instanton moduli space. Hence, 
it is not surprising that precisely this quantum mechanical system is obtained in [169] by dimensionally 
reducing the six-dimensional J\f = (0, 1) theory described above down to a single time dimension. On 
adding another world-volume dimension, we obtain the D1/D5 system discussed by Witten in [170]. 
The Dl-branes are now strings in a six dimensional Yang-Mills theory and, in the decoupling limit, 
their world-sheet dynamics is described by a two-dimensional M = (4, 4) non-linear a-model with the 
ADHM moduli space as the target manifold. This a-model has kinetic terms for the coordinates on 
the target and their superpartners and as usual the supersymmetric completion of the action involves 
a four-fermion term which couples to the Riemann tensor of the target. If we reduce this action back 
to one dimension by discarding spatial derivatives we obtain the quantum mechanics of [169]. 

In order to arrive at a description of D-instantons in the presence of D3-branes we must dimensionally 
reduce the world-volume theory of the D5-branes all the way to zero dimensions. However, in order 
to recover the instanton calculus, we must first perform a Wick rotation in the world-volume of the 
D5-branes. Vector quantities, including the E-matrices, in Minkowski space v a = (v°,v), a = 0, ... ,5, 
become v a = (v,iv°), a = 1,...,6 and the Euclidean action is S Eucl = —iS Mmk . After dimensional 
reduction, the symmetry group U(fc) x U(iV) x H of the six-dimensional system now has a simple 
interpretation: U(fc) is the auxiliary symmetry of the ADHM construction, U(iV) is the gauge group 
of the D3-brane theory, SU(4) is the i?-symmetry group of the M = 4 supersymmetry algebra and 
SO(4) = SU(2) L x SU(2) fi is the four-dimensional Lorentz group. 

We will now write down the collective-coordinate integral which determines the leading semi-classical 
contribution of k D-instantons to correlation functions of the low-energy fields of the IIB theory in 
the presence of N D3-branes. From the above discussion, the appropriate generalization of (10.52), is 
obtained by Wick rotating and then dimensionally reducing the partition function of the six- dimensional 
theory. The resulting matrix theory has a partition function 79 

Z k = Ar [ d\ d 8 \ d 3 D d 4 a' d 8 M' d 2 w d 2 w d 4 fx d 4 ft e~ s , (10.68) 

Vol U{k) J 

where the action is deduced from (10.57), (10.58) and (10.61) reduced to zero dimensions: 

4tt 2 

(47r 2 a' 2 S G + S K + S D ) , (10.69) 
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79 In this section, we shall not keep a careful track of the overall normalization of the partition function which, of course, 
is important for calculating physical quantities. 
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where g = g± and 

S G = tr k {l[ Xa ,Xb} 2 -^ B MxaM-lDl in } , (10.70a) 

S K = tr fe { - iXa,a' n } 2 + XaW«W,iXa - \^aABM taA [Xa,M'*\ + ^aAB^ ^ Xa) , (10.70b) 

S D = tr fc { - W ■ f^V* + a Pa a a& ) + i{ji A w* + w^ A + [A4' A , a^])^} • (10.70c) 

Note that Sq arises from the dimensional reduction of .Sgauge and Sk + Sd comes from the dimensional 
reduction of fitter + ^matter- Specifically, Sk contains the six- dimensional gauge couplings of the 
hypermultiplets while Sd contains the Yukawa couplings and D-terms. 

Semi-classical correlation functions of the light fields can be calculated by replacing each field with 
its value in the D-instanton background and performing the collective coordinate integrations with 
measure (10.68). In the case of the low-energy gauge fields on the D3-brane, the relevant classical 
configuration is simply the charge-A; Yang-Mills instanton specified by the ADHM data which appears 
explicitly in the action (10.70a)-(10.70c). Note that the collective coordinate integral (10.68) depends 
explicitly on the string length-scale a 1 only via the zero dimensional remnant of the kinetic terms of 
the vector multiplet. As a consequence correlation functions which include fields inserted at distinct 
spacetime points and x^ will have a non-trivial expansion in powers of y/a!/\x^ — x^\. In order 
to decouple the world-volume gauge theory from gravity we must take the limit a' — > keeping the 
four-dimensional coupling g fixed. In this limit, the coupling of Sk + Sd is held fixed while the remnant 
of the kinetic term of the vector multiplet, Sq, decouples from the action. In this limit, the collective 
coordinate integral (10.68) becomes 

Z k = } — [ d 6 x d 8 \d 3 D d 4 a' d 8 M' d 4 w d 4 fid 4 ft exp (-S K - S D ) . (10.71) 
VoIU(k) J 

We can now make contact with the N = 4 instanton calculus. Note that Sk can be written more 
compactly as 

47r 2 2n 2 - 

S K = ~^tr k XaLxa + ^S aAB tr fe M A M B X a ■ (10.72) 
T T 

We now recognize the partition function (10.71) as being the instanton partition function in its linearized 
form (6.90) with Sk + Sd being the instanton effective action S in (6.91). 

We have therefore recovered the leading-order semi-classical expression for the collective coordinate 
integral for instantons in the M = 4 supersymmetric theory. In addition the action Sk + Sd is invariant 
under eight supercharges which are inherited from the M = (0, 1) theory in six dimensions. One can 
check that after Wick rotation and dimensional reduction, the supersymmetry transformations (10.64a) 
and (10.65) (up to appropriate re-scaling by g) match the collective coordinate supersymmetries of the 
Af = 4 ADHM instanton calculus written down in (4.68) and (4.70), or (6.95a)-(6.95d). There are also 
eight additional fermionic symmetries, corresponding to £ A in (4.70), which only appear after taking 
the decoupling limit a' — > 0. These correspond to the half of the superconformal transformations of the 
J\f = 4 theory which leave the instanton invariant. 

One can ask how constrained instantons appear in this context. Constrained instantons arise 
when the M = 4 theory is on its Coulomb branch realized by separating the N D3-branes in the 
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six-dimensional transverse space. This obviously changes the lengths of string stretched between the 
D-instantons and D3-branes and has the effect of adding mass terms for the fundamental hypermulti- 
plet fields. At the level of the matrix theory action (10.47) the relevant effect can be introduced by the 
replacements (6.93). Taking the decoupling limit and integrating out the vector multiplet one recovers 
the instanton effective action (5.25) for Af = 4 constrained instantons. 

X.3.2 Probing the stringy instanton 

We have seen in the preceding section that many of the features of the instanton calculus are reproduced 
by considering the dynamics of Dp-branes lying inside D(p + 4)-branes. It is interesting to ask whether 
the actual ADHM form for the gauge potential (2.49) can be obtained in this string theory context. 
The answer is affirmative once an appropriate probe is identified. The concept of a probe which is able 
to "feel" the instanton background was first described by Witten [158] in the context of Type I string 
theory. The idea was generalized to the Type II theories needed for the present discussion by Douglas 
in [156]. The appropriate probe turns out to be a D-brane as well. In the context of the D5/D9-brane 
system described in the last section, the probe is a single Dl-brane (or D-string) whose world-volume 
lies parallel to the other branes. The D-string "feels" the D5/D9-brane background since the fields of 
the D-string world-sheet theory include a subset that arise from open strings stretched between the 
D-string and the other higher- dimensional branes. The whole configuration is like a Russian doll: the 
D5/D9 and D1/D5 sub-systems are both examples of our general Dp/D(p + 4) system described in 
§X.3.1. In the composite system, the ADHM variables appear both as fields of the D5-branes and as 
couplings of the world-sheet theory of the D-string. The conditions for the resulting world-sheet theory 
to be Af = (0,4) supersymmetric are precisely the ADHM constraints (2.65). 

In more detail, we now analyse the dynamics of a D-string in the background of k D5-branes and 
iV D9-branes. The system breaks the ten-dimensional Lorentz group to SO(l, 1) x SO(4)! x SO(4) 2 . 
Here, SO(4)i describes the directions transverse to the D5-branes denoted by vector index n — 1, 2, 3, 4. 
From earlier, SO(4)i ~ SU(2)l x SU(2)#, where the latter correspond to the spinor indices a and 
a. The other factors SO(l, 1) x SO(4) 2 are a subgroup SO(5, 1), the Lorentz group of the D5-branes' 
world-volume. We will denote the corresponding spinor indices of SO(4)2 — SU(2)a x SU(2)b by 5 and 
5, respectively. Under the decomposition SO(5, 1) D SO(l, 1) x SO(4) 2 

4 - (2, l)x + (1, 2)_! , 4 - (2, 1)_! + (1, 2)x , 6 - (2, 2) + (1, 1) 2 + (1, 1)_ 2 . (10.73) 

Here, the subscripts indicate the SO(l, 1) chirality. 

Let us ignore, for the moment, the presence of the D9-branes. The D-string/D5-brane system is an 
example of the Dp/D(p + 4)-brane system studied in the last section. Recall that the D-string world- 
sheet theory can best be derived as the dimensional reduction of the D5/D9-brane system, with one 
D5-brane and k D9-branes. Obviously, we will have to use a different notation to describe the fields 
of the D-string theory as well as re-assigning indices appropriately. The action of the theory is the 
dimensional reduction to two dimensions of (10.62) with the following replacements. Firstly, there is a 
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U(l) gauge field and fermions 

Xa ^(A ± ,x n ), n^iCiCi) (10.74) 

where A± are the light-cone components of the two-dimensional abelian gauge field. Note that x n 
denotes, in this context, a field in the world-sheet theory rather than a spacetime coordinate. Since the 
world-sheet theory is abelian the adjoint hypermultiplet previously denoted {a' n ,A4'^} decouples and 
we can safely ignore it. Finally, the fundamental hypermultiplets describing string stretched between 
the D-string and D5-branes are described by the replacements 

wma - 4>is , ^ - Of) , < -> 4% , fi A - (C 0?) ■ (10.75) 

The world-sheet theory describing the D-string in the presence of the D5-branes has conventional kinetic 
terms along with a potential 80 

3 
c=l 

Here, the second term comes from the D-terms which can be viewed as the ADHM constraint for the 
D-string viewed as a single instanton inside the U(k) gauge theory on the D5-branes. There are also 
Yukawa couplings 

£ Yuk = e?x aa e? + e^ a e ia + + c;^)o,, + (ge? + m )h ■ (10.77) 

The first two terms are the fermionic ADHM constraints for the Grassmann collective coordinates of 
the single U(k) instanton. 

At the moment we have assumed all the D5-branes are coincident. In reality the D5-branes will be 
separated (in addition to being "thickened out") in a way described by the ADHM matrices a! n . By 
translational symmetry, the effect can be introduced into the world-sheet theory by replacing x n by the 
matrix quantity 

X n > 3?nl[fc]x[fc] + a n ■ (10.78) 

In this way the ADHM variables a! n appear as couplings in the world-sheet theory. 

Now we must consider the effect of the D9-branes. First of all, there are new fields in the world-sheet 
theory describing open strings stretched between the D-string and the D9-branes. Since there are no 
additional moduli and the new fields are fermionic, e u and e u . The second effect of the D9-branes is 
that in the configuration we are interested in, they absorb the D5-branes and the latter thicken out in 
a way parameterized by the ADHM variables Wa- These variables will, like a' n , appear as couplings in 
the world-sheet theory. Taking into account the D9-branes leads to a modified potential which has a 
very suggestive form: 

3 

£ P ot = -0fAf A^. 4 + {^etihs) 2 ■ (10-79) 

c=l 

80 In the following expressions we do not keep careful track of the coupling constants and normalizations. 
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Here, A and A are the ADHM quantities introduced in §11.4 (with the canonical choice (2.57)) but 
where now x n is viewed as a field rather than as a spacetime coordinate. Note (10.79) subsumes (10.76). 
The Yukawa coupling have a similarly suggestive form: 

£ Yuk = r*A Mde e? + id Af t a + {tie? + cief)^ + &? + m)h , (10.80) 

where we have defined the composites 81 

T A = ( € e u ) , T A = (e u 9?) . (10.81) 



Note (10.80) subsumes (10.77). 



As is often the case, the conditions that the world-sheet theory is A~ = (0, 4) invariant is identical 
to the equations-of-motion in the target space: in this case the D-flatness conditions in the D5-brane 
theory. In other words, as argued in [158], the conditions for extended Af = (0,4) supersymmetry are 
the ADHM constraints (2.65) on {w A , a' n } viewed as coupling in the world-sheet theory. 

For generic ADHM data, the potential (10.79) gives a mass to the fields 0. These fields can then 
be integrated out leading to a (gauged) cr-model. At the classical level, we can simply set = 0. The 
fields x n are massless and the resulting a-model involves the flat metric on M 4 . The fermionic sector is 
more interesting. The first two Yukawa couplings in (10.80) give masses to 9f , Qia and a 2 k- dimensional 
subspace of both T A and Y A . What we are interested in are the remaining massless fermionic degrees-of- 
freedom. To order to identify these modes we need bases for the null spaces of A and A. But these are 
provided by the ADHM quantities U and U, respectively, as is apparent in (2.47). Hence, the massless 
modes {p u ,p u } are picked out by 

T A = VxuPu , T A = pJJl . (10.82) 
The kinetic term for the massless modes then follows by substitution into the kinetic term for T: 

Y A <9_ Y A — p u (5 uv d. + (d.x n )(A n ) uv ) Pv , (10.83) 

with an 

(A n ) uv (x) = U^-^- . (10.84) 

This is precisely the ADHM expression for the gauge potential (2.49) as a function of the field x n (with 
g — > 1). Of course the expression that we have derived for the gauge potential is only valid in the 
classical limit and there will be stringy corrections to the instanton profile on a scale set by \fa' . 



X.3.3 The M = 2 instanton calculus 



There are a number of ways to obtain four-dimensional Af = 2 theories as the collective dynamics of 
branes in string theory. The first observation is that the Af = 2 theory has 8 supercharges and may be 

81 Recall, from §11.4, A is the ADHM composite index u + ia. 



178 



obtained from the dimensional reduction of a six-dimensional theory with Af = (0, 1) supersymmetry. 
Just as in the case of the Af = 4 theory it is useful to realize the Af = 2 theory in its maximal dimension. 
In this case, an instanton will be a 1-brane soliton as described in §X.2. 



One way to realize the six-dimensional theory is to consider the collective dynamics of D5-branes 
embedded in a spacetime where the four transverse dimensions are orbifold IR 4 /Z 2 . The collective dy- 
namics of the D5-branes can be deduced in the following way [171-173]. First of all, suppose we start 
with a set of N parallel D5-branes in flat ten-dimensional Minkowski space. From the discussion in 
§X.3.1, the world- volume theory is simply the dimensional reduction to six of Af = 1 supersymmetric 
U(iV) gauge theory in ten dimensions. This theory has 16 supercharges, i.e. has Af = (1,1) supersym- 
metry in six dimensions. The theory has an SU(2),4 x SU(2) B i?-symmetry which occurs because of the 
SO (4) rotational symmetry of the transverse space. Now we replace the transverse space with the orb- 
ifold R 4 /Z 2 , where the Z 2 is chosen to act as the centre of SU(2)a- The resulting world-volume theory 
of the D5-branes is deduced by a process of projection in the following way. Notice that the or = Z 2 
acts on the fields via their i?-symmetry indices. The next part of the procedure involves embedding 
the Z 2 group action as cr^^ in the gauge group. There are different ways to do this. However, up to 
conjugation we can take 



with N = N + M. The resulting theory is then obtained by taking the fields and action of the 
original Af = (1,1) theory and projecting out by hand all the fields which are not invariant under 
the simultaneous transformation by <7rctu(jv)- The resulting theory has gauge group U(iV) x U(M), 
the supersymmetry is reduced to Af = (0, 1) and there are two hypermultiplets in the bi-fundamental 
representation of the gauge group, i.e. each having fields in the (JV, M) + (N, M). 

Now we consider instantons. In the six-dimensional theory the instantons correspond to Dl-branes 
lying inside the D5-brane, the same situation as in §X.3.1, but now lying transverse to the orbifold 
rather than M 4 . The collective dynamics of the Dl-branes in this configuration can be deduced from 
the Dp/D(p + 4)-brane system discussed in §X.3.1 (with N D(p + 4)-branes) and then performing the 
same kind of projection on the D 1-brane world-sheet theory that we did for the D5-brane world- volume 
theory above. Firstly, one embeds Z 2 both in the i?-symmetry group and in the gauge group, but now 
the gauge group is pertains to the Dl-branes. Firstly the i?-symmetry. In terms of the variables of the 
instanton calculus, the Z 2 is embedded as <xr in the group SO(6) ~ SU(4) that arose from the world- 
volume Lorentz symmetry of the D5-branes in the D5/D9-brane system. On dimensional reduction to 
the Dl/D5-brane system SO(6) is broken to SO(l, 1) x SU^)^ x SU(2) B and a R acts as the centre of 
SU(2),4. For instance, Xa which transforms in the vector of SO(6), breaks up into 



Here, the invariant components are the two-dimensional U(/c) gauge field on the Dl-brane world-sheet. 




(10.85) 




(10.86) 



For spinor quantities like if a and A4 



we have 




(10.87) 
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and similarly for fx A and fi A . The embedding Z 2 in the U(fc) gauge group of the Dl-brane world-volume 
theory determines the instanton charge of the configuration. For instance 

*u<*) = ( hk T l] i ) , (10.88) 

with fc = ki + k 2 describes an instanton configuration with charges k\ and k 2 with respect to the U(iV) 
and U(M) factors of the gauge group. In order to have an instanton which lives solely in the U(iV) 
factor we must set k 2 = so that k\ = k and in this case cu(fc) = I^xm- From the point-of-view of this 
instanton configuration the U(M) gauge group plays the role of a spectator since, to leading order, the 
U(M)-adjoint fields are zero and the U(M) symmetry is effectively a global symmetry. Consequently on 
dimensional reduction of the D5-branes to four dimensions, the resulting theory is effectively an M = 2 
theory with Np = 2M fundamental hypermultiplets. 82 

The world- volume theory on the Dl-branes is then obtained from the action of the D5-branes in 
the D5/D9-brane system, that is (10.56), dimensionally reduced to two dimensions and by removing by 
hand any fields that are not invariant under a simultaneous transformation o r RO"u(fc)0" U (jv)- The latter 
transformation has to be included because the fundamental hypermultiplets carry U(iV) gauge indices. 
In our case, recall that o"u(fe) — l[k]x[k]- The fields that remain are the following. Firstly, from (10.86), 
only the components Xa, a = 0, 1 remain. This is a U(fc) gauge field in two dimensions. For the other 
adjoint- valued fields, the bosonic quantities a' n survive, since these are invariant under <tr. For the 
fermions only the components ipA and -M /j4 , with A = 1, 2 remain. The situation is slightly more subtle 
for the fundamental hypermultiplets. Each index u can now run over N + M values which split into 
two set: u — 1, . . . , N and u' — 1, . . . , M. Then, for example, 

a V(N)( w uia) — Wuia , &V(N)( W u'ia) = ~ w u'ia (10.89) 

and similarly for the other hypermultiplet fields. Therefore, the fields w ui a survive along with //^ and 
jif u , for A = 1,2. However, in addition the fermionic fields \i A ti and [if u ,, with A = 3,4, are odd with 
respect to both or and cr v ^ and so survive. 

We now show that on dimensional reduction to zero dimensions and in the decoupling limit, the 
partition function of the resulting matrix theory is the leading order expression for the collective coor- 
dinate integral of the M = 2 theory with Np = 2M hypermultiplets. The details are almost identical 

— * 

to the N — 4 case described in §X.3.1. In particular, as previously, in the decoupling limit, the D 
and ipA (A = 1, 2) become Lagrange multipliers for the bosonic and fermionic ADHM constraints. The 
main difference is that the the dimensionally reduced (Wick rotated) action (10.72) is subject to the 
projection described above yielding 83 

4-77-2 , _ _ _ _ "I 

S K = — |tr fc Xa L Xa -\{xx- 1X2) [M\Ml - M 2 u M l u ) - |(xi + tx 2 ) {M 3 U ,M 4 U , - M 4 U ,M 3 U ,) } . (10.90) 

In this expression a sum over a = 1, 2 is implied. The integral over Xa yields the factor of |det fc 2 2>| _1 in 
(5.55) for Af = 2. While the action becomes 

S = ~ix k {MlMl-MlMl)L-\Ml,Mt.-MiMl,) . (10.91) 
9 



82 Note only Nf even theories can be obtained in this way. 

83 It is useful to recall that in Euclidean space Si = rf and S2 = iff 
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This is precisely equal to (6.84) after a suitable re-scaling by g, with the VEVs set to zero and with the 
relations 

JC f = (Ml,, M 4 U ,) , K f = i(M 3 u „ -Mi,) , (10.92) 

where / = 1, . . . , N F = 2M. Just as in the Af = 4 case, the instanton effective action on the Coulomb 
branch of the Af = 2 theory can easily be obtained by separating the D5-branes in the two dimensions 
transverse to their world-volume orthogonal to the orbifold. 

X.3.4 Mass couplings and soft supersymmetry breaking 

In §VI.3, we described how the instanton calculus was modified when mass terms were added to the 
field theory breaking supersymmetry successively from Af = 4, through Af = 2 and Af = 1 to Af = 
0. Using the response of the instanton to field theory masses, we were able to relate in §V.4 the 
collective coordinate integrals with different numbers of supersymmetries by a process of decoupling 
and renormalization group matching. In this section, we describe how the effects of mass terms may be 
realized in the brane description. 

First of all, let us establish at the level of the M = 4 field theory, the form of the mass deformation. 
It is useful at this stage to introduce the language of Af = 1 superfields. The Af = 4 theory consists of 
an Af = 1 vector multiplet along with three adjoint- valued chiral multiplets $; = {(pi/y/2, \f}, i = 1, 2, 3. 
The relation between these fields and the fields that we introduced in §IV.l can be chosen as 

Vl = ^05 + 06 , V?2 = i4>3 + 04 , V?3 = 201 + 02 (10.93) 

and 

Ai = A A , A ; = A A A = i=l,2,3. (10.94) 
With this choice the vector multiplet contains {A m , A = A 4 }. 

In terms of Af = 1 superfields the action of the Af = 4 theory (in Minkowski space) can be written 
S=±J d t zti N {£lmTW°W a \^ + 2*\e v ^+ . (10.95) 

The most general mass deformation that preserves Af = 1 supersymmetry can be obtained by adding 

Smass = ^ J dxHr N [ mi ^\ 62 +m*$[ 2 |_ 2 } . (10.96) 
In terms of component fields (in Euclidean space) 

S_ = 1 / ^{m^K - mT^W + l»,l 2 l«l 2 + >».A,A, + . (10.97) 

It is interesting to establish the SO (6) R-symmetry properties of the mass terms above. In general a 
mass term for the fermions can be written 

J d 4 xtr N {m AB \ A \ B + m AB \ A \ B } (10.98) 
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where the symmetric matrix m AB (m AB = m* AB ) transforms in the 10 (10) of SO(6). The mass matrix 
can be chosen to be diagonal: ttiab = diag(mi, m,2, m 3 , m 4 ). In order to preserve at least M = 1 
supersymmetry, at least one of the mass eigenvalues must vanish. Choosing m 4 = gives the fermion 
mass terms in (10.97). The mass matrix can also be written as a anti-symmetric rank-3 tensor T abc of 
SO (6). The relation between the two bases is provided by the E-matrices: 

T abc ~ m AB (S [a S 6 S c] ) AB + m AB (%Z b Z c] ) AB . (10.99) 

With some choice of normalization, the tensor T is associated to the following cubic coupling of the 
scalar fields: 

T abc tr N (p a <Pb<Pc = eijktrAr {TOi</?i</?}</4 - TO*v?i Vj^k + m 4 (pi(pjLp k - ra^V}^} . (10.100) 

When rri4 = 0, this is precisely the bosonic part linear in the masses of the Af = 1 preserving deformation 
in (10.97). 

The question before us is how to introduce the mass deformation when the M = 4 theory is realized 
as the the collective dynamics of N coincident D3-branes in Type IIB string theory? A Dp-brane 
carries charge which couples directly to the Ramond-Ramond p + 1-form potential C^ p+1 \ However, 
Dp-branes also couple to other background fields in the string theory. The most general couplings of 
the Ramond-Ramond potentials to a collection of Dp-branes occurs through the Chern-Simons action 
whose form, in the case of multiple branes, was established by Myers [174]. In particular, D3-branes 
can carry D5-brane dipole moment which is induced by a coupling to the potential of the form 84 

j 2 j d 4 xtr N {0 a 6 c£ 3a6 (0)} . (10.101) 

Consider the case where the associated 7-form field strength = dC^ is constant. Then the coupling 
above is simply equal to 

^ J d A xF^ 2Zabc ti N <p a <p h <p c = ^- 2 J d A x(* & F^) ab ti N <p a <p h <p c , (10.102) 

using F^ = *F^\ Here, * 6 is Hodge duality in the six- dimensional transverse space. Comparison with 
the mass deformation (10.97) shows that the background Ramond-Ramond field produces the bosonic 
terms linear in the masses if 

{HF^) abc = QT abc (10.103) 

and T is the tensor defined in (10.99). 

In this way we have reproduced the bosonic mass coupling (10.97) using a suitable background 
Ramond-Ramond potential. However, as one would expect due to supersymmetry, the background field 
also couples to the fermions. The analogous example of DO-branes coupling to has been investigated 
in [175]. By using T-duality we can extract the analogous coupling for D3-branes. The result is most 
easily written by recalling that the fermionic fields on D3-branes can be obtained by a dimensional 
reduction of a ten-dimensional Majorana-Weyl spinor. Using the representation of the Clifford algebra 
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In the following 0123 refers to the world-volume directions. 
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as in (10.11) and the decomposition of \& in terms of {\ A , \ A } in (10.13), the coupling to the background 
potential can be written in the form 

j 2 J d 4 xF«l 3at)C ti N <i>r a r b r c y . (10.104) 

Using (10.99) and (10.103) this can be written as 

J d 4 xtr N {m AB \ A \ B + m AB \ A \ B } , (10.105) 

precisely the fermion mass coupling in (10.97). 

Now that we have established how to introduce the mass coupling in the world-volume theory of the 
D3-branes, we can now consider the effect of the same background field in the D-instanton theory. Recall 
that the relevant effect for the D3-branes could be described as a D5-brane dipole moment coupling 
to C^. But non-trivial also implies a non-trivial background since dC^ = *dC^\ A D- 
instanton will carry Dl-dipole moment and so a coupling to the background field. The couplings 
can be deduced from the D3-C^ coupling established above by successive T-dualities reducing the 
D3-branes to D-instantons. The leads to the following couplings in the D-instanton action: 

^f^M xaxtxc + V^tr fc yr a r b r c y , (10.106) 

where \1/ is the 16-component fermion defined in (10.55) and T a are components of the ten-dimensional 
T-matrices (10.11) in the six-dimensional space transverse to the D3-branes. The field strength can be 
deduced by taking the dual of (10.103): 

F (3) = 6T , (10.107) 
where T is equal to the form of T with itiab replaced by — ttiab- The couplings (10.106) are then simply 

2 

^tr fc { - m AB M' A M' B + {A-Ka') 2 (2f abcX aXbXc + m AB ^ A ^ B ) } . (10.108) 

Notice that the couplings involving the vector multiplet {Xa^A} do not survive in the decoupling limit 
a' — > (fixed g). In this limit, the only surviving term is the mass coupling for A4' A which is identical 
to that in (6.89) (after the re-scaling M A — > g 1 ^ 2 M A ). In order to completely reproduce (6.89), the 
hypermultiplet fermions {fj, A , ft A }, arising from open string stretched between the D-instantons and D3- 
branes must also couple to in an analogous way dictated by their SO (6) transformation properties. 
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XI Further Directions 



What we have said in the preceding Chapters is far from the end of the story of the calculus of many 
instantons. There are two main developments that we turn to in this final Chapter. The first involves 
the fate of instantons in a rather esoteric generalization of the underlying gauge theory, or rather on 
the spacetime on which it is defined. The idea is to define the theory on a non-commutative version of 
R 4 . This non-commutative space is characterized by the fact that the spacetime position coordinates 
x m no longer commute. The motivation for consider such a generalization comes from string theory. 
We have already described how gauge theories can arise as the collective dynamics of D-branes in string 
theory and turning on certain background fields can make the resulting world-volume non-commutative. 
What is remarkable is that the non-commutativity of spacetime does little violence to instantons. In 
particular the moduli space is still a commutative space. In fact it is a very simple deformation of $Jlk 
obtained by taking non-zero central terms for the U(l) C U(/c) factor in the hyper-Kahler quotient 
construction. In particular the deformed space, which we denote 9Jtj? is still hyper-Kahler and rather 
remarkably it is a smooth resolution of the original space: instantons can no longer shrink down to zero 
size due to the non-commutativity. In addition, instantons now become non-trivial in theories with an 
abelian gauge group like QED. 

The second development we describe in §XI.2 is a new way to calculate instanton effects beyond 
a single instanton, or two for SU(2), when scalar fields have VEVs. As we have seen, one must use 
the constrained instanton formalism of Affleck. To leading order in the semi-classical approximation, 
we have seen that the effect of the VEVs is to turn on a non-trivial instanton effective action which 
acts as a potential on the instanton moduli space. We will show in the context of the calculation of 
the prepotential in M = 2 theories that the resulting integrals over the instanton moduli space have 
the remarkable property that they localize around the critical points of the instanton effective action. 
This is guaranteed by the existence of a nilpotent fermionic symmetry, or BRST operator [176]. In fact 
the critical points correspond to configurations where all the instantons shrink down to zero size. The 
singular nature of this configuration may be regularized by making the spacetime non-commutative 
since this removes the singularities of the instanton moduli space. Using this technology we will be 
able to rather swiftly, following [95], re-derive the one-instanton contribution to the prepotential in 
SU(iV) that we calculated in §VIII.4 and then extend the method to two instantons. Very recently this 
technique of localization has for the first time enabled a calculation of instanton effects to all orders 
in the instanton charge for finite N [96]. The relevant theory is the Af — 4 theory softly broken to 
M = 2 by giving mass m to two of the three adjoint- valued chiral multiplets and the physical quantity 
that can be calculated to all orders in the instanton expansion are the 0(m 4: ) terms in the prepotential. 
Clearly the new technique of localization looks very promising and we refer the reader to [95, 96] for 
more details. 
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XI. 1 Non- commutative gauge theories and instantons 



It is not at all obvious but the rather bizarre generalization of gauge theories defined on a non- 
commutative background spacetime has a rather pleasant and simple effect on instantons. Non- 
commutativity is also of special interest in the framework of string theory and D-branes [177-180] 
(see Ref. [181] for a review). Here we will briefly examine non-commutativity from the instanton per- 
spective. 

We have already seen in §11.6 that the moduli space DJlk of instantons in ordinary commutative gauge 
theories fails to be a smooth manifold due to conical singularities arising when the auxiliary symmetry 
group U(fc) does not act freely. Physically these are points where individual instantons shrink to zero 
size, i.e. w iU a = for a given i e {1, ...,k}. There is a natural way to resolve, or blow up, the 
singularities of DJlk — > 9Jtjp preserving the hyper-Kahler structure described by Nakajima [182,183]. 
The important observation is that the quotient group U(fc) has an abelian factor and, as we explain 
in Appendix B, one has the freedom to add to each of the three moment maps a constant term in 
the Lie algebra of any abelian factor. In the context of the ADHM construction this freedom involves 
modifying the ADHM constraints (2.59a) by adding a term proportional to the identity k x k matrix 
to the right-hand side as in (2.70): 

T** j3 oPO'a — C(+)l[fc]x[fc] • (11-1) 

Now consider the clustering limit in which the i th instanton is well separated from the remaining. In 
this limit, the effective ADHM constraints of the single instanton are modified from (2.102) to 

where we recall that pi is a measure of the size of the instanton. Without-loss-of-generality, suppose 
oc= (0, 0, 1), then it is easy to see that 

P 2 i>\\C M \ ■ (H.3) 

Therefore in the presence of the central term, an instanton can no longer shrink to zero size and the 
singularity in the instanton moduli space corresponding to that process is smoothed out. For instance, 
the U(l) C U(fc) corresponding to phase rotation of w ui a for fixed i, no longer a fixed point. In fact, 
one can show that the whole of the auxiliary group U(fc) no longer has any fixed points on SDtjP. 

The question is what physical effect can introduce the central term in the ADHM constraints. 
Remarkably, we will see that precisely this smoothed instanton moduli space 9Jtjp, defined by (11.1), 
arises in the ADHM construction of instantons in gauge theories formulated on a spacetime with non- 
commuting coordinates [184]: 

[x m , X ra ] i@mn • (11"^) 

— * 

The 3-vector is precisely the anti-self-dual projection of the non-commutativity parameter 9 mn . We 
also define the self-dual projection which appears in the ADHM constraints of anti-instantons: 

C ( C +) = fj'mJmn , QL) = tAn , C = 1, 2, 3 . (11.5) 

where rf mn and ff mn are 't Hooft ^-symbols (see Appendix A). 
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It is quite remarkable that non-commutative instantons are non-trivial even for an abelian U(l) 
gauge group. In this case, the deformed instanton moduli space nJtjj. is simply a resolution of the 
space Sym fe R 4 . 85 In fact, the case of an abelian gauge group, considered by Nekrasov and Schwarz in 
Ref. [184], provided the first examples of explicit non-commutative instanton configurations. Various 
aspects of the ADHM construction on non-commutative spaces were further developed and clarified in 
Refs. [185-191]. Explicit examples of instanton solutions in U(iV) gauge theories with space-space as 
well as space-time non-commutativity were constructed and analysed in [190]. In general, for \(\ > 0, the 
moduli spaces of these solutions contain no singularities. However, for semi-classical functional integral 
applications it is important to ensure that the instanton gauge field itself is also non-singular — or to be 
more precise is gauge equivalent to a non-singular configuration — for all values of the argument [190]. 
The supersymmetric collective-coordinate measure for non-commutative instantons was constructed 
and the one-instanton partition function calculated in Ref. [176]. Based on these results, instanton 
contributions to the prepotential of the non- commutative M = 2 pure gauge theory were determined 
in [192] and found to be equivalent to the corresponding commutative contributions confirming the 
hypothesis that the non-commutative version of the theory is described by the same Seiberg-Witten 
curve [192,193]. 



XI. 1.1 ADHM construction on non-commutative R 4 



We will work in flat Euclidean spacetime R 4 with non-commutative coordinates x m which satisfy the 
commutation relations (11.4) where 9 mn is an antisymmetric real constant matrix. Using Euclidean 
spacetime rotations, 9 mn can be always brought to the form 

/ 12 






Or, 













o\ 



#34 

0/ 



11.6) 



In terms of complex coordinates 

zi= x 2 + ixi , zi — x 2 — ixi , 

z 2 = x 4 + ix 3 , z 2 = x A - ix 3 , 

the commutation relations (11.4) take the form 

[zi,zi] = -26»i2 , [zi,Zj] = , 

[Z2, Z 2 ] = -26*34 , [Zi, Zj^i] = , 

where i,j = 1,2. Besides the usual commutative case, there are two important cases to consider: 



(11.7) 



11.8) 



1. When either £? 12 or 6*34 vanishes, the matrix 6 mn is of rank-two. This case corresponds to the direct 
product of the ordinary commutative 2-dimensional space with the non-commutative 2-dimensional 

85 This is the fc-fold symmetric product of R 4 which arises from solving the ADHM constraints (2.65) for N = 1. The 
solution is Wia = and a' n — — diag(X^, . . . ,X%), X % <G R 4 , which fixes all the U(fc) symmetry apart from permutations 
of the positions X\. Modding out by the permutations gives the fc-fold symmetric product of M 4 . This space is singular 
whenever two, or more, of the point-like instantons come together, since then the group of permutations does not act 
freely. 
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space, M^c x For definiteness we set here 6> 34 = and introduce the notation 9 12 = — C/2 in such a 
way that 

[ Zl ,Z 1 ] = -(, [32,32] =0, [Zi,Zj\=U. (11.9) 

Physical applications of this situation involve gauge theories defined on a background with non-commutative 
space but commutative time. 

2. A rank-four matrix 9 mn (with 612 7^ and #34 7^ 0) generates the non-commutative Euclidean 
spacetime = Rj^ c x ffi^c- The corresponding world-volume gauge theory has non-commutative 
(Euclidean) time. Since both components of 9 are non-vanishing, they can be made equal, 9i 2 = 634 = 
— C/4, with appropriate re-scalings of the four coordinates x m and, if necessary, a parity transformation. 
Equations (11.8) become 

c 

[zi,Zj] = --5ij , [zi,Zj] = . (11.10) 
In fact, the conditions Q\ 2 = ±#34 corresponds to (anti-) self-duality: \e mnkl 9 kl = ±9 mn . 

A Hilbert space representation for the non-commutative geometry (11.9) or (11.10) can be easily 
constructed by using complex variables (11.7) and realizing z and z as creation and annihilation oper- 
ators in the Fock space for simple harmonic oscillators (SHO). The fields in a non-commutative gauge 
theory are described by functions of z 1 ,z 1 ,z 2 ,z 2 - In the case of R^, c x M 2 , the arguments z 2 and z 2 are 
ordinary c-number coordinates, while z\ and Z\ are the creation and annihilation operators of a single 
SHO: 

z i \ n ) — VC Vn + l\n) , z\ \n) = a/C \fn \n — 1) . (H-H) 

The non-commutative spacetime M^c = ^nc x ^nc requires two oscillators. The SHO Fock space H is 
spanned by the basis 1 711,712) with 711,712 > 0: 



zi \n u n 2 ) = )J~^ Vn~i + 1 |ni + 1, n 2 ) , z 2 |rai, n 2 ) = \ \Jn 2 + 1 K, n 2 + 1) , 
^iK^Hyf v^K-W , 3 2 |ni,n 2 ) = l 72 !)^ - 1) • 



(11.12) 



The integral on IR^c i s defined by the operator trace, 

J d A x * = (2n) 2 Vfet9 Tr * = (^j 2 Tr * . (11.13) 



The ADHM construction of the instanton now proceeds exactly as in Section §11.4 but one has to 
keep an eye on the ordering of operators. We assume the canonical form (2.57), (2.58) for the ADHM 
matrices and analyse the requirements imposed by the factorization condition (2.50). They amount to 
the modified ADHM constraints (cf. (2.65)) 

T^paPcta = C(+)hk]x[k] ■ (H- 14 ) 

The three conditions (11.14) are the modified ADHM constraints for the instanton. When = 
Eqs. (11.14) give the standard commutative ADHM constraints (2.59a). When non-commutativity is 
present, the constraints are modified by the anti-self-dual component of 9. Thus, the ADHM constraints 
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for the instanton in a self-dual-^ background on M^ c are equal to those of commutative M 4 . However, 
the constraints for the instanton in non-commutative space ]R^ C x R 2 are always modified since 9 cannot 
be self-dual. The constraints are 

r c ^A d = <r 3 ci WxW . (ii.i5) 

The constraints for an anti-instanton follow from solving the same factorization condition (2.50) with 
the matrix A = a + bx. In this case the ADHM constraints are modified by the self-dual component of 
9: 

T a^da = C(-)l[fc]x[fc] • (11.16) 

The explicit one-instanton and one-anti-instanton solutions for U(iV) gauge theory were constructed 
in [190] for space-space, x R 2 , and spacetime, IR^c non-commutativity by resolving the corre- 
sponding (modified) ADHM constraints and solving the completeness relation (2.51). As always, one 
has to distinguish between two types of singularities: the singularities of the instanton field A m (x) as 
the function of the argument, and the singularities arising for certain values of the collective coordinates 
interpreted as singularities on the moduli space. Instanton configurations can be determined in singular 
or regular gauges; in singular gauge all the singularities of instanton configurations as functions of x m 
are simply gauge artifacts and can be gauged away. 

Instantons with space-space non-commutativity arise from (11.15) where ( ^ and their moduli 
space contains no singularities. Instantons with spacetime non-commutativity contain no singularities 
on the moduli space unless the the self-duality of 9 coincides with the self-duality of the instanton 
field-strength such that the ADHM constraints (11.14) and (11.16) collapse to (2.59a). 

XI. 1.2 The prepotential of non-commutative Af = 2 gauge theory 

We can now consider the Af = 2 supersymmetric U(iV) gauge theory formulated in non-commutative 
space. On the Coulomb branch, the gauge group is broken to U(l) c x U(l) Ar_1 , where U(l) c is the 
overall U(l) factor of the non-commutative U(A r ) gauge group which decouples in the infrared due to 
the infra-red/ultra- violet mixing as explained in Ref. [192]. Our goal is to determine the low-energy 
dynamics of the U(l) iV_1 factor. 

In a similar way to the ordinary commutative case reviewed in §VIII, the corresponding low-energy 
effective action is determined by the Seiberg-Witten prepotential T as explained in Refs. [192,193]. In 
particular, note there are no commutative "star-products" in the low energy effective action since we 
are concerned only with the leading-order terms in the derivative expansion of the effective action. As 
in the commutative theory, instantons and anti-instantons contribute to various correlation functions. 
By considering the instanton contributions to these correlation functions one can relate the instanton 
coefficients of the prepotential to the centred instanton partition function. We shall assume that the 
re-scalings (8.22) have been performed and so the relation is identical to (8.24). Although we did 
not consider the anti-instanton contributions in Chapter VIII, one can easily derive an anti-instanton 
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version of (8.24) which involves the complex conjugate of the prepotential. Summarizing, we have for 
k > 

Here, Z ( ± k ~ 2,NF) are the instanton and anti-instanton centred instanton partition functions of the non- 
commutative theory and defined as integrals over the moduli spaces %Jl±l'- 

where, the supersymmetric volume form over the resolved centred instanton moduli space is explicitly 



f u w = 2^-2^2(^-1) g fc / rf 4fc (^+ fe )- 4 a f[ d 2 *^*)" 2 ^ Idet^Xl 1 -^ 

M« Voiu(fc)y f 1 i 

(11.19) 

k 2 , 3 AT 2 >. V 7 



r=l c=l A=l »=1 y 

The expression is identical to the expression for the supersymmetric volume form of DJlk defined in 
§VI.5 apart from the fact that the bosonic ADHM constraints are modified appropriately to include the 
central terms. 

From (11.19) we conclude that the centred instanton partition function can only depend on ( {+) 
while the centred anti-instanton function can only depend on Given (11.17), the aforementioned 
dependences are very restrictive: the prepotential cannot depend on ( and therefore should be identical 
to that in the commutative theory. This was the hypothesis that was made in Ref. [192] for the theory 
with no hypermultiplets. In the next section, we will develop a localization formalism in which we 
can prove rigorously that the centred (anti-)instanton partition function cannot depend smoothly on 

— # — # 

Q ±) . There are possible discontinuities when ( i±) = 0, since at this points the (anti-)instanton moduli 
space becomes singular and the localization formalism breaks down. However, we can already test 
this hypothesis at the one-instanton level by re-doing the calculation of §VIII.4 in the context of the 
non-commutative theory. It is simple to establish the modifications that arise from including the non- 
commutativity parameter (. The instanton effective action (8.49) has the additional coupling 

AttHD-Cm ■ (11.20) 

— * 

This modifies the D integral in (8.61) 

^ e -4^5.c (+) TT a u (n 21) 

D 2 ^ K| 4 + L> 2 

— * — * 

The angular integrals over D = (\D\,6,<f>) are now non-trivial: 

/ d{cose)d9e-^<^ = si *( 4 ^IK(+>l) . (1L22) 

J n\ D \\C(+)\ 

We then extend the integral over \D\ from — oo to +00 and perform it by standard contour integration 
yielding 

1^-1^1 N *2 

vrX n £ - e y K ' MK n ufe) ■ < 1L23 » 

u=l "« u =l u v=l I" 1 "' 
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compared with the right-hand side of (8.61). The behaviour of the modified integrand in the vicinity 
of one of the N singularities x — ~ 0«j i- e - a u = 0, is 



,a„ tt a 



2tt 2 — IT , m i „ +••• (11-24) 

rv 11 \rv 4 _ Lv 4 V / 



(AO 



and so unmodified from the commutative calculation. The contribution to the centred instanton par- 
tition function from the N singularities is therefore unchanged from (8.66). However, the contribution 
from the sphere at infinity in x-space is modified since in the non-commutative case the asymptotic 
behaviour of the integrand is changed. In fact contrary to (8.68) we now have 

-2iN6 



and consequently the contribution from the sphere at infinity vanishes because either the integrand 
fi(r,9)f2(re td ) falls off too fast, for Np < 2N, or due to a vanishing angular integral, for Np = 2N. 
Hence, in the non-commutative theory the result for the one-instanton coefficient of the prepotential is 

N N Np 

= e n n<™/ + <&) ■ ( n - 26 ) 

u=1 „=i \Yv YuJ f=1 
(*«) 

compared with (8.72) in the commutative case. 

Notice that the additional contribution denoted S[ Np> which we identified as arising form the vicinity 
of the singularity on the instanton moduli space, is missing from the non-commutative result, as might 
have been expected since the singularity of 9DTi has been resolved. At the one-instanton level, we see 
that the commutative and non-commutative results are equivalent up to an unimportant constant for 
N F < 2N. This is entirely consistent with the hypothesis that commutative and non-commutative 
theories have the same SU(iV) low-energy effective action. However, the finite theory with N F = 2N 
does have the apparently physically significant difference due to the term proportional to X^=i(^°) 2 that 
arises in the commutative theory (8.72) but which is missing from (11.26). This difference, however, 
is not physically relevant since it can be explained by a re-definition of the coupling between the 
commutative and non-commutative theories. This follows from the expression for the prepotential in 
(8.7). If the coupling in the non-commutative theory is related to that in the commutative theory by a 
series of instanton couplings: 

oo 

r nc = T + c k e 2mkT , (11.27) 

k=l 

then SU(iV) part of the the low-energy of the non-commutative theory is equal to that of the commu- 
tative theory. At the one-instanton level we have 

ci = — (11.28) 

1X1 

and 0:2 is the coefficient in (8.70). Matching the non-commutative and commutative theories requires 
for higher instanton number a relation of the form 

N 

^ = T k -2itid k Y J (€Y , (11-29) 
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up to irrelevant (mass- dependent) constants, where dk are determined in terms of the c^. It is difficult to 
prove this relation rigorously. However, in the next section we prove that JF^ C cannot depend smoothly 
on hence, the difference T^ c — Tk cannot depend on ( {+) . Then assuming that the contribution 
to the difference arises from the sphere at infinity in %-space, 86 generalizing the situation at the one- 
instanton level, the result can be argued to be polynomial in the masses and VEVs. Given that the 
prepotential has mass- dimension 2, this leaves (11.29) as the only possible VEV dependence. 

It should not have escaped the readers notice that something unexpected happens in the finite theory 

— * — * 

when the non-commutativity parameters are chosen so that either = or = 0. In the former 
case, the instantons are those of the non- commutative theory, described by the smooth moduli space 
9JtP, while the anti- instantons are conventional commutative ones, described by the singular space Wli. 
In this case, even at the one-instanton level (11.17) is violated since the anti-instantons receive the 
contribution <$( (Ar F- 2JV) from the singularity on the instanton moduli space, while the anti-instantons do 
not; hence 



In addition, a re-definition of the form (11.27) cannot reconcile the commutative and non-commutative 
theories. It is not clear how one should describe what is really happening at these points in the non- 
commutativity parameter space and whether the resulting behaviour of the low-energy effective action 
is physically acceptable. 

XI. 2 Calculating the prepotential by localization 

The major applications of the many instanton calculus that we have hitherto reported (the two- instanton 
contribution to the SU(2) prepotential in §VIII.3.1 excepted) have involved a large- N limit. Clearly 
one would like techniques to calculate instanton effects for any N and k. The problem is obvious: the 
integrals over the instanton moduli space are just too complicated to be done by brute force beyond 
instanton number one (or two in the special case of gauge group SU(2)). Even the integral over the 
one-instanton moduli space required to obtain the first instanton coefficient of the prepotential with 
SU(iV) gauge group, undertaken in §VIII.4 was far from elementary. 

However there is hope: there is interesting mathematical structure underlying the integrals over the 
instanton moduli space that define the instanton partition function and hence the instanton coefficients 
of the prepotential. They are related to integrals that give topological invariants; for example, in the 
M = 4 case the integral is precisely the Gauss-Bonnet-Chern integral that — at least on a compact 
manifold — gives the Euler characteristic. 87 When the theory is on the Coulomb branch instantons are 
constrained and there is a non-trivial potential on the instanton moduli space caused by the non-trivial 
instanton effective action. This acts as a Morse potential and the integrals localize around various 
subspaces of the instanton moduli space around which the quadratic approximation is exact. This kind 
of localization is a generalization of the Duistermaat-Heckman Integration Formula that arises in various 

86 This should follow from the localization technology developed in the next section. 
87 We do not discuss the mass-deformed TV = 4 to TV = 2 theory in this review: see [176]. 




(11.30) 
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supersymmetric contexts (see, for example, [194-198] and references therein). The obvious problem to 
using this theory is the fact that the instanton moduli space is non-compact and has singularities when 
instantons shrink to zero size. The problem with the singularities can be be alleviated by considering 
the resolution of the instanton moduli space SJtjp relevant to a non-commutative theory as described 
in §XI.l. It will turn out that the integrals over the instanton moduli space in the non-commutative 
version of the theory on the Higgs or Coulomb branch is amenable to a form of localization. This will 
allow us to calculate the two-instanton contribution to the prepotential in an SU(iV) (or rather U(iV) 
since in the non-commutative version the additional abelian factor is inevitably included) theory. In 
fact the localization method greatly simplifies the integrals even at the k > 2 level and we expect that 
further progress may be possible. 

In retrospect it should not be a surprise that the instanton partition function localizes in the non- 
commutative theory. We will show that the subspaces of SCtjP on which the instanton partition function 
localizes correspond to products of (non-commutative) U(l) moduli spaces Wl { P\ N=1 . In fact the lo- 
calization submanifolds are associated to each partition of k — > k\ + • • • + fcjy, k u > 0, and have the 
form 



where the quotient is by the overall centre of the instanton. This is a manifestation of the fact that in the 
non-commutative theory there are exact — no longer constrained — instanton solutions on the Coulomb 
branch. The reason is that the non-commutativity prevents instantons shrinking to zero size and there 
are now stable solutions of infinite size and the consequences of Derrick's theorem are avoided. The fact 
that these solutions are exact even in the presence of a VEV follows because the non-trivial fields only 
take values in the 11(1)^ subgroup of the U(iV) gauge group and so these fields commute with the VEVs. 
The partitions correspond to the instanton charge k u in each of the N U(l) subgroups. It is then an 
issue as how one can relate the prepotential in the non-commutative theory to that in the commutative 
theory. We have already argued in §XI.1.2 that they must essentially be the same. Our calculations 
at the one- and two-instanton level confirm this hypothesis, up to an interesting re-parameterization of 
the coupling in the theory with Np = 2N. 

Before proceeding with the more convenient formalism (from the point-of-view of performing actual 
computations) based on the linearized instanton partition function constructed in §VI.5, it is useful to 
consider the problem in terms of the intrinsic geometry of 971^. We first note in the M = 2 theory, 
the number of Grassmann collective coordinates (we consider only the case Np = for the moment) 
equals the dimension of 971^; namely, AkN. We have already seen in §IV.2.2 that the Grassmann 
collective coordinates ip A , for each species A, can be thought of as the components of Grassmann- 
valued symplectic tangent vectors to 97tfc. In the Af = 2 case, there is also an isomorphism between 
them and the symplectic basis of one forms 



Here, the one forms h ia manifest the Sp(n) x SU(2) structure of the (co-)tangent space of TWl k and are 
related to the inverse Vielbeins via h ia = h^^dX^ (see Appendix B). Note that the isomorphism (11.32) 




(11.31) 



partitions 
ki,...,k N 



i) lA h 



a = A . 



(11.32) 
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preserves the structure of differential calculus: both Grassmann variables and one forms anti-commute. 
In addition, the Af = 2 supersymmetric volume form on 971^ (5.15) can be re-interpreted in terms of 
differential forms: 

. „ 2kN 2 2kN 2 „ 

/ ^hx, ^ = n-^ / { a a { n n ^ ^ 

J ™k JVRk ;=i a=l J i=lA=l ^ J (11.33) 

= n- 2kN [ T{X, K h ) (a — A) , 

where the function T is expanded in wedge powers of the forms h ia until a top form is obtained which 
can then be integrated . 

Using the isomorphism between the Grassmann collective coordinates and one forms, the instanton 
effective action (5.42) for Af = 2 (so the final curvature coupling is absent) can be written succinctly as 
the inhomogeneous differential form 

S=~d v (V}dX li ). (11.34) 

Here, V and are the (anti-)holomorphic components (defined as in (4.20)) of the (commuting) tri- 
holomorphic vector fields on 971^ denoted V a , a = 1,2, introduced in §V.2.1, associated to the VEVs. 
The other quantity in (11.34) is an equivariant exterior derivative 88 

d v = d-2i v . (11.35) 

One can easily verify that 

d 2 v = 2£ v , (11.36) 

the Lie derivative with respect to V. Hence, dy is nilpotent on SU(iV)-invariant differential forms. The 
centred instanton partition function can then be written in form notation as 

2^ = *~ 2kN [ exp ( - \d v <ytdX*)) , (11.37) 

which means that the terms must be pulled down from the instanton effective action in order to make 
a top form on the centred moduli space 9Jl k (of degree AkN — 4). 

The final form (11.37) is precisely the kind that can localize. To see this consider the more general 
integral 

^(A) = *- 2kN I exp ( - X-'dviV^)) . (11.38) 

We then have 

dZ^(\) =n -2kN x -2 dy ^ dx » exp £ _ X -i dv ( V \ dx »)) } (11.39) 

using the fact that d v is nilpotent on SU(iV)-invariant quantities. But since the volume form is SU(iV) 
invariant the integral — under favourable conditions — vanishes and so 2^ =2) (A) is independent of A. 

88 Hcre, i v implies contraction with the vector V. 
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Hence, it can be evaluated in the limit A — > where the integral is dominated by the saddle-point 
approximation around the critical points of the vector field V a . In the present case, the fixed-point set 
has been identified in §V.2.2 with the configurations where all the instantons have shrunk to zero size. 
The potential problem is that the theory of localization is most easy to apply to situations involving 
compact spaces without boundary. In the case at hand, the instanton moduli space is obviously not 
compact and has conical singularities when instanton shrink to zero sized. As we have already intimated, 
the way to alleviate the problems caused by the singularities is to consider the analogous problem in 
the theory defined on a non-commutative spacetime. As explained in §XI.l, the deformed instanton 
moduli space 971^9 is then a smooth resolution of 971^: instantons can no longer shrink to zero size and 
the corresponding canonical singularities are smoothed over. In this case, as we shall find below, the 
technique of localization can be used to considerably simplify the computation of the centred instanton 
partition function. In particular the critical-point sets become smooth manifolds of a very suggestive 
form. 

The problem with formulating localization in terms of the intrinsic geometry of the instanton moduli 
space is that, due to the presence of the ADHM constraints, we do not have an explicit description of 
the intrinsic geometry. It will prove much more convenient to work in terms of the linearized formalism 
described in §VI.5. It will also prove easier to formulate the theory of localization in terms of Grassmann 
variables instead of differential forms. The expression for the centred instanton partition is (6. 103). 89 

In the language of Grassmann variables, the translation of the covariant derivative is a fermionic 
symmetry Q, or BRST operator, which is nilpotent — at least up to symmetries. The fermionic sym- 
metry that we need to prove the localization properties of the integrals is defined by picking out a 
particular supersymmetry transformation. These latter symmetries, acting on the variables of the lin- 
earized formulation, are given in (10.64b) and (6.85). From the supersymmetry transformations we can 
define corresponding supercharges via 5 = C,aAQ aA - The fermionic symmetry that has the appropriate 
properties is then defined as the combination 90 

Q = 6a A Q° A . (11.40) 

Notice that the definition of Q mixes up spacetime and i?-symmetry indices as is characteristic of 

89 We also perform the re-scaling (8.22) so that there are no factors of g in the instanton effective action (6.96). 

90 Note that a BRST-type operator was first constructed in the context of the Af — 2 instanton calculus in Refs. [199-201] . 
(In particular, the latter reference is most closely related to the approach that we adopt in this paper.) In particular, 
these references emphasize the relation with the topologically twisted version of the original gauge theory. However, 
these references did not go on to use the existence of Q to develop a calculational technique based on localization. A 
nilpotent fermionic symmetry was also constructed in the context of the Af = 4 instanton calculus in Rcf. [176] where 
localization was first proposed as a method to calculate, in this case the Af = 4, instanton partition function. It was 
then shown in [199] that the Q operator in the Af — 2 theory could be obtained by the orbifolding procedure described 
in §X.3.3 in the present article. Some recent papers [202] have also considered the Q-operator and the Af — 2 instanton 
calculus, although they use the equivalent language of differential forms described above. These references then go some 
way toward interpreting Tk as a topological intersection number. 



194 



topological twisting. The action of Q on the variables is 





A 




= -2e AA (^X + 0V 








= -2^ A [a' oAlX ] 


Qx = 


o, 


Qx ] 




Qn = 




QD 


= ^0A,X] , 




Q}C = . 







(11.41a) 
(11.41b) 
(11.41c) 

(11.41d) 
(11.41e) 

It is straightforward to show that Q is nilpotent up to an infinitesimal U(k) x SU(iV) transformation 
generated by x an d <t>- For example, 

Q 2 Wc t = 2i(w 6lX + <P Wa) . (11.42) 
In terms of Q, the instanton effective action (6.96) assumes the form 

S=QE + T, (11.43) 

with 

E = 47r 2 tr fc {i e ^(^ X t + o t/i ^ + i e . A -/d Q[A1 M )X t] + 8 A ^i{a% - ±C (+) • t*j)} (11.44) 

and 

N F 

r = _ vr 2^tr fc ((m / - X )/C / £ / ) . (11.45) 
/=i 

Note that QT = so that the instanton effective action is "Q-closed": QS — 0. Notice in (11.44), to 
anticipate what follows, we have allowed for a non-trivial non-commutativity parameter ( (+) . 

Notice that both the action S and the integration measure u} {M=2 ' Nf) are Q-invariant. The localization 
argument argument proceeds as before. Consider the general integral 

Z ( f =w (\) = [ u,<"= a ^> exp ( - X-'QE - T) . (11.46) 

We then have 

k [ } = A- 2 / W ^»Q Sexp(-r 1 Q--r) , (11.47) 
oX Jwx k L J 

using the fact that Q 2 S = Qr = 0. Since the volume form is Q-invariant the right-hand side of (11.47) 
vanishes. Consequently, Z ( ^ 2 ' Np) (X) is independent of A and can, therefore, be evaluated in the limit 
A — > where the integral is dominated by the critical points of Q S. Since the result is independent of A, 
under favourable circumstances — which will be shown to hold in the present application — the Gaussian 
approximation is exact. 

— # 

Note that the quantity S depends on Q +) and the anti-holomorphic components of the VEV. Hence, 
the derivative of the instanton effective action with respect to either or 0°^ is Q-exact and so the 
instanton partition function cannot depend on these parameters. On the contrary, the holomorphic 
components of the VEVs enter through the action of Q itself, while the hypermultiplet masses enter 
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through T, and so the centred instanton partition function can depend on these parameters. In particu- 
lar, from what we have said above, we have proved that the centred instanton partition function cannot 
depend smoothly on the non-commutativity parameter Of course, there will be a discontinuity 
when = when singularities appear on the instanton moduli space and the localization argument 
breaks down. 



Following the logic of localization we should investigate the critical points of QE. For the moment 
we suppose Q +) = 0. The terms to minimize are, from (10.39), 

\WaXa + 0Vf - [Xa, a' n f . (11.48) 

Notice that this is positive semi-definite and the critical points are simply the zeros. Hence 

WaXa + <PaWa = [Xa, d' n ] = . (11.49) 

Notice that these equations are identical to those in (5.43). Hence we can identify the critical-points of 
QS which the fixed points of the tri-holomorphic vector fields generated by the VEVs. The fixed-point 
set corresponds to the singular subspace Sym fc M 4 where all the instanton have shrunk down to zero size 
and the resulting gauge potential is pure gauge. 



In order to avoid the singular nature of point-like instantons, Ref. [95] suggested a regularization 
based on the smooth resolution of the instanton moduli space first described in purely geometrical 
terms without reference to the gauge theory by Nakajima in Ref. [182]. Of course, as discussed in 
§XI.l, subsequently it was realized that this smooth resolution of the instanton moduli space 
arises naturally when the theory is defined on a non-commutative spacetime [184]. What happens is 
that the existence of the central term in the ADHM constraints (11.1) prevents instantons shrinking to 
zero size. The solutions are associated to the partitions (5.44) and have the form (5.46). The ADHM 
constraints within the u th block are now the ADHM constraints are now those of a non-commutative 
gauge theory with a U(l) gauge group. Taking the trace of the ADHM constraints within the block we 
have 



Yl W iu W uia = k u C(+) ■ (11.50) 
i=k u -i+l 

Now, in contrast to (5.47), in the presence of the central term, tiw must be non-trivial. The critical- 
point set associated to the partition (5.44) is consequently a product of non-commutative U(l) instanton 
moduli spaces: 



L=i x • • • x L=i 



(11.51) 



Recall from the discussion in §XI.l that 9D f l^| JV=1 is a smooth resolution of the symmetric product 
Sym' 



fc TD>4 



In fact in the non-commutative theory the consequences of Derrick's Theorem are avoided and there 
are now exact non-singular solutions to the equations-of-motion even in the presence of VEVs. These 
exact solutions were called "topicons" in [96] since they have a contribution to the action which is 
localized and, unlike an instanton, they have no size modulus. Actually there are N flavours of topicon, 
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associated the N block in (5.44), obtained by embedding the spacetime non-commutative U(l) instanton 
solutions in each of N unbroken abelian factors of the gauge group. So the following picture emerges. 
For instanton charge k, the exact instanton solutions come as a disjoint union of spaces associated to 
the inequivalent partitions (5.44) where each k u corresponds to each of N U(l) subgroups picked out 
by the VEV. Hence, the space of exact solutions, or "moduli space of topicons" , lying within the larger 
instanton moduli space is of the form 



y fajj^ . ( ii.52) 

topicon IK. 

partitions 

fclH hfcjv 



XI. 2.1 One instanton 



We will now begin using localization to evaluate the one-instanton contribution to the prepotential of 
the M = 2 supersymmetric SU(iV) gauge theory with N F hypermultiplets. The instanton effective 
action (6.96) has iV critical points, labelled by v e {1, 2, . . . , N}, at which (5.46) 

Xa = -(<l>l)v, W udl OcS uv . (11.53) 

Note that a' n = in the one-instanton sector. Without-loss-of-generality, we choose our non-commutativity 
parameters 

C) = C? +) = , Cf +) = C> . (11.54) 
In this case the ADHM constraints (11.1) are solved on the critical sub manifold with 

W U a = ^/C^SuJal , (11.55) 

for an arbitrary phase angle 9. The integrals over w V a are then partially annulled by the three 5- 
functions in (3.17) that impose the ADHM constraints, leaving a trivial integral over the phase angle 
6: 

r 3 

/ d 2 w v d 2 w v Y[5(±T™p(wP Wvdl - (5 c3 )) = SttC 1 . (11.56) 

J c=l 

Correspondingly, the (^-functions for the Grassmann ADHM constraints saturate the integrals over 

f 2 

J a=l 

for each A = 1,2. The remaining variables, {w U a, A^}, u ^ v , as well as {/Q/, /C/j}, are all treated 
as Gaussian fluctuations around the critical point. To this order, the instanton effective action (6.96) is 



, n n f 

S = 4vr 2 Cxi + E {Ml u \w U a\ 2 + ^I^Vua) - \ J2(m f + 4§K f Kf [ + •••, (11.58) 

u=l f=l ' 



where (<f>a) U v = (0a)« — (0a)f The integrals are easily done. Note that the integral over Xa yields a 
factor of C -1 which cancels against the factors of ( arising from (11.56) and (11.57) so the final result is, 
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as expected, independent of (. Summing over the N critical-point sets gives the centred one-instanton 
partition function 

N N N F 

- = E { II mr 1 ^ IK'"/ + <&} ■ (11-59) 

Notice that the resulting expression is holomorphic in the VEVs and independent of ( as required. 
The result should be compared with the brute-force calculation of the one-instanton contribution in 
(8.72). The expression is entirely consistent with (11.59) for N F < 2N. 91 We can also compare (11.59) 
with the predictions of Seiberg-Witten theory written down in (8.8a). The expression agree for all for 
N F < 2N. Recall that the Seiberg-Witten predictions as written are only valid for N F < 2N; however, 
our expression (11.59) is simply the obviously extrapolation of (8.8a) to N F = 2N. 

The case with N F = 2N, which corresponds to the finite theory, is rather special. Notice that the 
expression in the non-commutative theory misses the contribution ^j^f- 2 ^' Q f (8.72) which is quadratic 
in the VEVs. We have already identified this term as arising from the singularity of the one-instanton 
moduli space SEJti. However, we have already argued at the end of §XI.1.2 that the mismatch is not 
physically relevant since it can be accommodated by a non-trivial mapping between the couplings of 
the commutative and non- commutative theories. 



XI. 2. 2 Two instantons 



We now turn to the situation for k — 2. There are two kinds critical-point corresponding to two topicons 
of the same, or of different, flavour, respectively. We now evaluate these two contribution separately. 
As in the one-instanton sector we choose the non-commutativity parameters as in (11.54). 

Two topicons of different flavour 

For two topicons of the different flavour, the ADHM constraints are solved with 

w uia = VCe Wi S UVi 6 &1 , a' n = . (11.60) 

The two phase angles $t, % — 1,2, are not genuine moduli since they can be separately rotated by 
transformations in the U(2) auxiliary group. The variables Y n are the genuine moduli representing the 
relative positions of the two topicons. The corresponding solution of the fermionic ADHM constraints 
(4.34) on the critical-point set is 

» A = H A = 0, M «=( P S - p a) > 

where p A are the superpartners of Y n . Notice in this case the critical-point set is non-compact since the 
separation Y n e M 4 . The issue of whether the integral over Y n converges is rather delicate. By explicit 
calculation we shall find that the integral is indeed convergent. 

91 The constant factor S[ Np> for Nf = 2N — 2 and 2N — 1 does not affect the low energy effective action which depends 
only on derivatives of the prepotential with respect to the VEVs. 
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We now proceed to evaluate the contribution to the centred instanton partition function from the 
critical-point set. First we expand around the critical values (11.60) and (11.61). It is convenient to 
use the following notation for the fluctuations 

K = Z A , SM' A = i) (11.62) 



n / \ a 



and to make the shift 



Xa-Xa-[^ U1 (0 O } ) , (11-63) 



"2 



so that Xa = on the critical submanifold. We then integrate over the Lagrange multipliers D and 
which impose the ADHM constraints (2.65) and (4.34). The diagonal components of the constraints (in 
i,j indices) are the ADHM constraints of the two single U(l) instantons. The off-diagonal components 
vanish on the critical-point set and must therefore be expanded to linear order in the fluctuations. For 
the bosonic variables we have 

VCe- Wl (w Ul2 ) 2 + VCe^(w U2l )* 2 + AifjLYmZn = , (11.64a) 
-iVCe- m (w Ul2 ) 2 + iy/(e ie2 (w U2i y 2 + Mf mn Y m Z n = , (11.64b) 
V^Ce-^K^i + ^TCe ld2 {w U2l )\ + Aiff mn Y m Z n = , (11.64c) 

where ff mn = ^tr 2 {r c a m a n ) are 't Hooft's 77-symbols defined in Appendix A. Similarly in the Grassmann 
sector 

y/Ce^fit, + 2(p aA ^i - ° aA Y al ) = , (11.65a) 
VCe-* 1 ^ + 2(p aA Z a2 - a aA Y a2 ) = , (11.65b) 
y/Ce ie ^ A ul + 2(e aA Y al - p aA Z* al ) = , (11.65c) 
+ 2(e aA Y a2 - p aA Z* a2 ) = , (11.65d) 

where Y adl = Y n a a6in , etc. These equations correspond to a set of linear relations between the fluctua- 
tions. It is convenient to define 



K l2 )i = e^(£ + A) , (w U2l )l = e -**(-£ + A) , (11.66) 

so that the fluctuations £ drops out from (11.64c). We can use (11.64a)-(11.64c) to solve for (w Ul2 ) 2 , 
{w U2 i)2 and A, and (11.65a)-(11.65d) to solve for p A U2 , p 2ui , p, A l2 and jl A 2l . We then use the U(2) 
symmetry to fix (i) the fluctuation Z n to be orthogonal to Y n , Z n Y n = 0; and (ii) 9i = 0. The Jacobian 
for the first part of this gauge fixing is 

d u a > - if / d*Yd 3 Zd 3 Z*Y 2 . (11.67) 



VolU(2) J 7T 2 

Now we turn to expanding the instanton effective action (6.96). First the bosonic pieces. To 
Gaussian order around the critical point 

S h = si 1) + Si 2) + --- , (11.68) 



199 



where 

4^ = C((Xa)?l + (Xa)l 2 ) + 8r 2 |(Xa)l 2 | 2 + 2|(0°) U1U2 £ + VliXa)^ + 2(#)U (l + ^Y^Zf 

(11.69) 

and 

2 AT 

^ 2) = E E (#)S->«*I 2 - (ii-7o) 



47T 2 

1=1 11=1 

(^«1,«2) 



In order to simplify the integration over the fluctuations, it is convenient to shift 

— (11-71) 

Vra) ulU2 

and define the orthogonal decomposition 

Xa = xl + Xa, Xa(0X« 2 =O. (H-72) 

After having done this (11.69) becomes 

4^ = C((Xa) 2 n + (Xa)l 2 ) +2C(1 + 4C- 1 K 2 )|(xf)i2| 2 + 8y 2 |(xl)i 2 | 2 
+ 2(« lU2 (le| 2 +(l + 4C-^ 2 )^| 2 ) 



(11.73) 



To Gaussian order the Grassmann parts of the instanton effective action (6.96) are 

S t = £ f (1) + £ f (2) + • • • , (11.74) 

where 

-^A 1] = -5«C* (i + 4C^ 2 R^ + v aA (2r 1 « lla )tz ai y^ + xU/)^ 
+ «7 aA (2r 1 « 1la ) t r ffl4 z^ - xSi^)^ - arVS^^z*^ 

and 

1 2 AT 2 ATjr 

^f = !E E ft/6-A-lED m /+W/*- ( n - 76 ) 

i=l «=i i=l /=1 

By shifting the fluctuations a A and £ A by the appropriate amounts of p A , we can complete the square 
yielding 

ss# > =-*«j t (i+<c- i n»-^ (1177) 

- 2iC-'(l + 4C- 1 i' 2 )-V"' 4 (<,«Z I , 4 Z*"' 9 + 2xLv; i Z- 4 ' s + 2x4 1 Z t , li V"" 3 )p fl .4 . 

Before we proceed, let us remind ourselves that only the variables Y n and p A are facets of the 
critical-point set, the remaining variables are all fluctuations. The contribution to the centred instanton 
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partition function from the critical-point set is then proportional to 

d 4 Y d£ dC d 3 Z d 3 Z* d 8 Xa d 4 p d 4 a d 4 e 

2 r N n f . (11.78) 

x J] J] d 2 w m d 2 w lu d 2 fi m d 2 p tu 11 d]C if dK fi \ Y 2 exp(-^ 1) - §l 2) - - S^) . 

1=1 V u=l f=l J 

The integrals over the Grassmann variables {<^ai £ ti Pa) are saturated by pulling down terms from 
yielding the factors 

«J 4 C 2 (i + 4C- 1 r 2 ) 2 (4r 2 ( x t 1 z - x \ 2 zr + (<P° u \ u f(z 2 z* 2 - (z ■ z*) 2 )) ■ (n.ra) 

The integrals over the remaining Grassmann variables {/x^, plf u , /Q/ , ICfi} , u ^ Ui,u 2 , are saturated by 

(2) 

pulling down terms from /5 f giving rise to 

2 N Np 

n n (<iu 2 i[( m f+M( m f+M ■ ( n - 8 °) 

i=i «=i f=i 

The {Z,^,Xa} integrals are 
J didCd 3 Zd 3 Z*d\ a ( K AY 2 { X \ 1 Z- X \ 2 Z*) 2 + 2{^ , (11.81) 

which yields the non-trivial factor 

(0°t ) 2 

while those over -uw, u ^ ui,u 2 , give a factor 

2 N j 

n n T^-- (n- 83 ) 

i=l u=l K^raJuui 

Finally all that remains is to integrate over the relative position of the instantons. As promised the 
integral is convergent: 

c 2 



^KTIFv = 96- (1L84) 

Putting all the pieces together with the correct numerical factors gives the final contribution of the 
critical-point set to the centred instanton partition function 

9 2 TV N F 

n n t^iik+ok+o- ( n - 85 ) 

Vruiui) 1— 1 u= i \'ruu i J j =1 

(^«1,»2) 

Notice that the result is holomorphic in the VEVs as required. Summing over the |iV(iV — 1) critical- 
points for topicons of different flavours, we have the following contribution 

2. (00 _ O)2 > i 11 - 86 ) 

u,v=l "vJ 
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where we have written the answer in terms of the functions 

TV , N F 



V = l \ TV) 
(*«) 

defined in [90]. 

Two topicons of the same flavour 

There are N critical-points of this type, describing two topicons of the same flavour: u\ = U2 = v G 
{1, . . . ,N}. On the critical submanifold {w V ia,a' n } and {/x^, jif v , M.'^ } satisfy the ADHM constraints, 
(11.1) and (4.34), respectively, of two instantons in a non-commutative U(l) theory. The remaining 
variables all vanish and are treated as fluctuations around the critical-point set. 

As previously, it is convenient to shift the auxiliary variable Xa by its critical-point value: 

We now expand in the fluctuations {w U ia, p^ii VhvSi f° r u 7^ v -> as wen as {^if^fi}- Since all the 
components of the ADHM constraints are non-trivial at leading order the fluctuations decouple from 
the (^-functions in (3.17) and (5.53) which impose the constraints. The fluctuation integrals only involve 
the integrand exp —S, where S is expanded to Gaussian order around the critical-point set. However, 
it is important, as we shall see below, to leave Xa arbitrary rather than set it to its critical-point value; 
namely, Xa = 0, after the shift (11.88). The fluctuation integrals produce the non-trivial factor 

N j Np 

II 77— 7777 772 II det 2 ((m / + V2>°)l [2]x[2] - X ) = S v (<fi - XJSM ~ A a ) • (11-89) 

«=i (det 2 (x + 0Ll[2]x[ 2 ])j /= i 

Here, Aj, % — 1, 2, are the eigenvalues of the 2x2 matrix x and S u (x) was defined in (11.87). 

The remaining integrals involve the supersymmetric volume integral on VJt 2 ^\ N _ 1 , into which we 
insert the integrand (11.89) which depends non-trivially on X . Now by itself (c) i u> (-V-=2) = 0. This 

^2 I N=l 

is clear from the linearized form of the instanton effective action (6.96) with N — 1 and Np = 0: 
integrals over the Grassmann collective coordinates pull down two elements of the matrix X ^ from the 
action and since there are no compensating factors of X the resulting integrals over the phases of the 
elements of x will integrate to zero. This is why we left X arbitrary in (11.89) since after expanding in 
powers of the eigenvalues A« it is potentially the quadratic terms that will give a non-zero result when 
inserted into J-~( f )i u; (Ar=2) . To quadratic order (11.89) is 

^ I JV=1 

So the contribution from this critical-point set is of the form 

JA(0J ^F + :J2 ^ wrl> (1L91) 
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where the VEV-independent constants 3 lj2 are given by the following integrals 

\ = \ f (A 2 + A 2 ) ee f ^ (I(tr 2X ) 2 - det 2X ) , 

1 \N=1 1 \N=1 



(11.92) 

% = I , w<"= 2) AiA 2 = / , ^ 2) det 2X . 



I N=l 



im?>\ Jim 

~ Ijv=i ~ 

We remark that (11.91) is holomorphic in the VEVs as required. 

The moduli space ^P\ N=1 is the Eguchi-Hanson manifold [186], a well-known four- dimensional 
hyper-Kahler space [203]. So after all the Grassmann variables and Xa have been integrated out, we 
can write 3 lj2 as integrals over the Eguchi-Hanson space of a suitable integrand. The following results 
were proved in the Appendix of [95]: 

3i = \, h = 0. (11.93) 

Hence the final result for the contributions from the N critical points of this type to the centred instanton 
partition function is 

!5><«^- <»■«> 

u=l 

Finally, summing (11.94) and (11.86) we have the centred two-instanton partition function 

^ = Z2 = h «-^) 2 + ~ t h M ' ) ^Kf ' (1L95) 

For N < 2N F it is astonishing to find exact agreement with the prediction from Seiberg-Witten theory 
(8.8b). As in the one-instanton sector, our expression (11.95) with N F = 2N is simply the extrapolation 
of the formula (8.8b). We can also compare (11.95) with the explicit brute-force integration over the 
instanton moduli space for gauge group SU(2) that we reviewed in §VIII.3.1. The agreement is exact 
for N F < 4. As in the one-instanton sector the case with N F = 4 is rather special since we expect 
the non-trivial matching of the coupling constants of the commutative and non-commutative theories 
displayed in (11.27). At the two-instanton level this matching implies 

N 

J= 2 = + 2m Cl ^ c + tnc 2 ^(0°) 2 , (11.96) 

u=l 

modulo constants. Since we have already determined c\ in (11.28) this matching condition is non-trivial 
even for gauge group SU(2) since the coefficients depend non-trivially on the masses and the VEVs. 
Using (11.59), (11.95) (for N = 2) and (8.45), all with N F = 4, one finds complete consistency with 

1 ' 7 (11.97) 



c 2 



- 



su(2) 2 3 ?ri V 2 4 3 5 2 4 / ' 



The question is whether the localization technique can be extended beyond two instantons? In fact 
it already has in the context of the Af = 4 theory softly broken to Af = 2 by mass terms as described 
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in Ref. [96]. In this reference the leading-order terms in the mass expansion of the prepotential were 
calculated to all orders in the instanton charge and were shown to agree with the predictions of Seiberg- 
Witten theory. It is clear that this subject is moving quickly and it seems possible that one might be 
able to provide a completely self-contained instanton proof of Seiberg-Witten theory. 
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Appendix A: Spinors in Diverse Dimensions 



In this appendix we define our conventions for spinors in various dimensions. The treatment will be 
be geared toward the applications needed in the main text. 

The Minkowski space metric will be chosen to be t]mn = diag(— 1, 1, . . . , 1). The D-dimensional 
Minkowski space Clifford algebra is 

{r M ,rV} = 2r] MN , (A.l) 
with M, N = 0, . . . , D — 1. The D-dimensional Euclidean space Clifford algebra is 

{Tm,T n } = 15 mn , (A.2) 

with M, N = 1 . . . , D. 

We now define representations of the Clifford algebra in several (even) dimensions. In all our 
representations the additional element of the Clifford algebra is 

r D+ i= (\ °i) , (A.3) 



so that Weyl spinors are of the form 



M and . (A.4) 



For D = 2, we take 
in Euclidean space and 



J \ip 



< _! o) - r '=(?;j 



in Minkowski space. Weyl spinors are complex and real, respectively. 
In four-dimensional Euclidean space we take 

■*•=(». 7") ' ™= !> 2 > 3 > 4 > (A.7) 

where a n = (if, 1) and a n = (—if, 1) are the Euclidean space a-matrices. 92 We will also use the 
quantities 

- - K- \ ^ ' 

Importantly, a mn is self-dual while a mn is anti-self-dual: 

@mn ^mnkl^kl j &mn ^mnkl^kl ■ (■^•9) 

92 Here, f are the usual Pauli matrices. 
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They can be expressed as 

0~mn = \i r ]mn fC > a mn = ^Vmn^ J (A. 10) 

in terms of 't Hooft's eta symbols [2] rj^ and ff mn , a = 1, 2, 3, defined below in (A. 18). 



n = 0,l,2,3, (A.ll) 
(— l,r) and <r n = (— 1, — r) to agree with the 

In both Euclidean and Minkowski space, we follow the usual convention of writing two-component 
Weyl spinors as ip a: a — 1, 2, and tp a , a = 1, 2. A Dirac spinor is 

*=($)■ < A - 12 > 

In Euclidean space the Weyl spinors are pseudo real and so ip a and i/j a are independent quantities. In 
Minkowski space the Weyl spinors are complex and one can define a four-component Majorana spinor 
of the form (A. 12) with 

^ = (V«) t , a = a. (A.13) 

In this notation, the a-matrices have indices o n aa and a® a . In both Minkowski and Euclidean space, 
the indices are raised and lowered with the anti-symmetric tensor e: 

e 21 = e 12 = l, e 12 = e 21 = -l, e n = e 22 = . (A.14) 

In addition, we use the summation conventions: 

n = r<Pa, M = M & . (A. 15) 



In Minkowski space, we take 

In = 



a r , 

-Or,. 



where we take the Minkowski space a-matrices o n 
notation of Wess and Bagger [47] . 



In six-dimensional Euclidean space we take 



r. = [l E ») , (A.i6) 

where the E-matrices are defined as 

— / 00 11\ \ / 

S a = ( - V , iV , -V , if l , -V , if} ) , 
and in their turn, the 4 x 4-dimensional matrices rf and rf, c = 1, 2, 3, are 't Hooft's eta symbols [2]: 

Vab = V c ab = £cAb A, B e {1,2,3}, 

vIa = V c A4 = $cA, (A. 18) 

Vab = ~Vbai Vab = ~Vba- 
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In six-dimensional Minkowski space we can take (A. 16) with 



£„ = (ir) 3 ,ir) 3 ,r) 2 ,ir) 2 ,r) 1 ,ir) 1 ) , 

Sa = ( - iv 3 , iff, -V 2 , iff, -V 1 ,^ 1 ) , 



(A.19) 



with a = 0,1,2,3,4,5. 

The following properties of the E-matrices are valid both in Euclidean and Minkowski space: 



A positive (negative) chirality Weyl spinor is written ip A (i[>a), A = 1,2,3,4. For consistency the 
S-matrices have indices and £ a AB- In Euclidean space the Weyl spinors are complex while in 
Minkowski space they are pseudo-real. In the latter case when there are an even number of such 
spinors we can introduce the concept of a "symplectic-real spinor" ipf, i = 1, . . . , 2p, satisfying the 
pseudo-reality condition 



where Qij is a (symplectic) matrix with QQ* = — lp^xpp]- 

We can easily build up a representation of Clifford algebras in higher dimensions from those in lower 
dimensions. Suppose, we have Clifford algebras in p and q dimensions with Euclidean space generators 
Tn \ n = 1, . . . ,p, and a — 1, ... ,q. A representation of the Clifford algebra in p + q dimensions 
can then be constructed by taking the products 



^aAB^aCD — ^aAB^aCD — IeaBCD 
^aAB^aCD = —^AC^BD + 25aD$BC ■ 



(A.20) 



(A.21) 





(A.22) 
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Appendix B: Complex Geometry and the Quotient Construction 



Our intention in this appendix is to introduce some basic properties of complex manifolds 93 and 
describe the "quotient construction" . 

A complex manifold 9JT admits a complex structure I, a linear map of the tangent space to itself, 
satisfying I 2 = —1, which is integrable. This latter property means that the torsion, or Neijenhuis 
tensor, vanishes, so for any two vectors X and Y 

[IX, IY] - [X, Y] - I[X, IY] - I[IX, Y] = . (B.l) 

Necessarily 9JT must be of even dimension, 2n. A Hermitian metric g on 9JI is invariant under i", so 

g(IX,IY) = g(X,Y) , (B.2) 

for any two tangent vectors X and Y. The fundamental 2-form uj is defined by 

uj(X,Y)=g(IX,Y) . (B.3) 

(Notice that uj is anti-symmetric as a consequence of (B.2) and I 2 = —1.) We will denote (real) local 
coordinates on 9JI by x^, fi = 1, ... , 2n; however, a complex manifold always admits local holomorphic 
coordinates (z 1 , z 1 ), i = 1, . . . , n, for which 

In this basis, the Hermitian metric and fundamental 2-form are 

g = gijdz 1 dz ] , uj = ig-^dz 1 A dz 3 . (B.5) 



A Kahler manifold is one for which the fundamental 2-form is closed, in which case the latter is 
called the Kahler form. The closure of the Kahler form implies (and follows from) the complex structure 
i" is covariantly constant 

V M J = , (B.6) 

with respect to the Levi-Civita connection associated to g. In addition, in holomorphic coordinates, 
the metric is given in terms of a Kahler potential K by 

d d 

Another way to characterize a Kahler manifold is via its holonomy group. Generically, on a manifold of 
dimension 2n this would be 0(2n); however, since parallel transport with the connection respects 
the holomorphic structure (does not mix holomorphic and anti-holomorphic indices) the holonomy of a 
Kahler manifold is contained in the subgroup U(n) C 0(2n). 

93 We will use the term "manifold" loosely to encompass spaces which are generically smooth but may have certain 
singularities. 



208 



We now proceed to define a hyper-Kahler manifold. Such a space 9Jt admits three independent 
complex structures I^ c \ c = 1,2,3, (which we often represent as a three- vector J), satisfying the 
algebra 

j(c)j(d) = _ S cd + e cdej{e) ? ( Rg ) 

which are covariantly constant with respect to the Levi-Civita connection of a metric g which is Hermi- 
tian (B.2) with respect to each of the I. There are three Kahler forms Co each related to the metric as in 
(B.3). The space 9JI is necessarily 4n dimensional. As before, we will take fx = 1, . . . , 4n, to be a set 
of real local coordinates for 9JI. There are holomorphic coordinates with respect to each of the complex 
structures, but generally there are no local coordinates which simultaneously represent the action of all 
the J; consequently we will have to work in terms of non-coordinate bases. The tangent space T9JI of a 
hyper-Kahler manifold admits the following Sp(ra) x SU(2) description. There is a vielbein hia^ and an 
inverse h ia ^. The composite index id, i = 1, . . . , 2n and a — 1, 2, manifests the fact that the vielbein 
picks out an Sp(n) x SU(2) structure in T9J1. 94 The metric takes the form 

g = h^ ll h^ u n^dx^dx v . (B.9) 

Here, f^, i = 1, . . . , 2n, is an Sp(ra) 2-form. The index a — 1, 2 is an SU(2) spinor index which can be 
raised and lowered in the usual way by the e tensor associated to SU(2) spinors (see Appendix A). The 
complex structures act on the SU(2) indices of the vielbeins in a simple way: 

(1^ ■ = -in^phVp , (B.10) 

where r c are the three Pauli matrices. Notice that the three complex structures can be rotated as a 
three- vector under action of the SU(2), although this action is not generally an isometry of Wl. The 
three Kahler forms corresponding to the three complex structures are 

Q = in^T^jh^^Kdx" A dx u . (B.ll) 



On a hyper-Kahler manifold, since the tangent space admits an Sp(n) x SU(2) structure, one can 
define the notion of symplectic tangent vectors. These are quantities, like A4\ which carry Sp(n) indices 
only and which have a symplectic inner product 

Sl(M,M) = £kMW. (B.12) 

For a hyper-Kahler manifold the SU(2) part of the spin connection (not to be confused with the 

Kahler forms) vanishes. This means 

The tensor fiy is covariantly constant with respect to the Sp(n) part of the connection. Similarly, the 
only non-vanishing components of the Riemann tensor are 

^iaj/3k7l5 = R iW € al3 e jS ■ ( B - 14 ) 

This is a symptom of the fact that the holonomy group of a hyper-Kahler manifold is restricted to 
Sp(n) C U(2n) C 0(4n). The Sp(n) curvature Ri^i is a totally symmetric tensor in all its indices. 

94 The group Sp(n), which has rank n, is often denoted as USp(2n). 
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After the brief introduction to complex geometry and, in particular, hyper-Kahler spaces, let us now 
turn to a way of constructing such spaces known as the quotient construction. We begin, initially, with 
the simpler Kahler case. The quotient construction is a way of constructing a Kahler manifold 97t in 
terms of some larger dimensional Kahler manifold, the "mother" space, 971 which admits some group of 
isometries G which preserves both the metric and complex structure. We will be interested in the case 
when G is some compact Lie group and we will denote the Hermitian generators T r , r = 1, . . . , dim G. 95 
Each generator T r of G defines a vector field X r over 97T in the usual way. Since the group action 
preserves the metric and complex structure the Lie derivatives Cx r I = £x r 9 — 0. This implies that X r 
is a holomorphic Killing vector. The two conditions imply a third; namely, C-x r ^ = 0. Since 

£x r & = i Xr d£b + d(i Xr oj) = d(i Xr oj) , (B.15) 

there exits a Hamiltonian function fi Xr where d/i Xr = ix r &- m In fact this only defines [i Xr up to 
a constant which can be fixed, up to the abelian factors in G, by requiring that ji x is equivariant: 
Xfx Y = ij^ x,y \ In this case, the Hamiltonians are known as moment maps. As T r varies in the Lie 
algebra of G we can define the function \x from 971 to the Lie algebra 

dim G 

fi=Y,^ Xr T r . (B.16) 

r=l 



The Kahler quotient is then 

Tl = fi-\0)/G , (B.17) 

in other words the quotient of the subspace 91 C 971 on which the moment map vanishes, the so-called 
level set, by the group G. The daughter 971 is of dimension 

dim 971 = dim 971 -2 dim C7 (B.18) 

and inherits a complex structure, metric and connection from 971. In order for the quotient to be well 
defined, we require that G, at least generically, acts freely on 971. There may be points at which the 
action fails to be free, in which case there will be orbifold-type singularities in the quotient. Since, 

g(IX r ,X s ) = u(X r ,X s ) = ~{X r ,dn x ') = -^ x '\ = , (B.19) 

using the equivariance property, a basis for the vectors in the tangent space normal to the level set is 
given by IX r , r — 1, . . . , dim G. 

Before we discuss the induced metric, complex structure and connection, let us say a little more 
about the G-quotient part of the construction. It is useful to think of 91 as a principal G-bundle over 
the quotient 97T = 9t/G. Hence there is a projection p : 9T — > 971. The G-action picks out a subspace 
of V C T9T called the vertical space spanned by the vector fields X r . The tangent space T971 is then 
identified with the quotient vector space T9T/V. Let the horizontal space, IK C T9T be the subspace of 
vectors orthogonal to V, i.e. g(X,X r ) = for all r, so that 

T9T = K©V. (B.20) 

95 We will take an inner product on the Lie algebra of G with Tr T r T s = S rs in some matrix representation. 
96 This reasoning assumes that the first cohomology group R) is trivial. 
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A tangent vector X E TWl has a unique horizontal lift to "K (which by a slight abuse of notation we 
denote with the same letter). From these definition it follows that a vector X e "K C TWl satisfies 

g(X,IX r )=g(X,X r ) = . (B.21) 



The complex structure I in the mother space naturally induces one in the daughter. To see this it is 
enough to notice from (B.21) that i" preserves "K. Consequently, the lift of J is simply equal to J acting 
on "K and the integrability condition (B.l) is automatically satisfied. The induced Riemannian metric 
on Wl is obtained in a analogous way. For two vectors X, Y e TWl (with lifts to % denoted by the same 
letter), g{X,Y) = g(X,Y). Finally a Levi-Civita connection VonOT induces such a connection V on 
the daughter in the following way. First of all, the projection of the connection V to the tangent space 
T91 C TWl defines a connection on 91. It is easy to prove that the connection on 91 is of Levi-Civita 
type. This is then identified with the pull-back of the Levi-Civita connection V on Wl to 91, under the 
projection p. 

Of particular importance for the instanton calculus is the volume form induced on the daughter 
space (in the hyper-Kahler case discussed below). It will be convenient for applications to the instanton 
calculus to define a volume form which, using physicists' language, is not gauge fixed. This means a 
G- invariant volume form uj on the level set 91 divided by the volume of the G-orbit at that point. A 
bone-fide volume form on Wl could then be obtained by a gauge fixing procedure, but we shall not 
necessarily need to perform this operation. From what we have said above, it is straightforward to see 
that the volume form we are after is 

/ «, ^ / * -fff- J] ^) ■ (B-22) 
Jm Jm Vol G (x) £A 

Here, Gj is the canonical volume form on Wl, Vo\g(x) is the volume of the G-orbit through a point 
x G Wl and J(x) is a Jacobian factor that arises when the integral over Wl is restricted to 91 by the 
explicit 5-functions. Geometrically, J(x) is the square root of the determinant of the matrix of inner 
products of the normal vectors to the level set. Since a basis of such vectors is provided by IX r , the 
Jacobian is simply 

J(x) = |det£| 1/2 , (B.23) 
where L is the dim G x dim G matrix of inner products with elements 

L rs = ~g{ IX r , IX S ) = g(X r , X s ) , (B.24) 

Since G generically acts freely on Wl, each orbit is, up to a scale factor, a copy of G itself. Hence, the 
volume of the orbit through a point on the level set is 

Vo\ G (x) = |det X| 1/2 VolG , (B.25) 

Here, VolG a constant, the volume of the group in some canonical normalization. Notice that the 
factors of the determinant of L cancel to leave 

/. W = W*°n^)- (B.26) 
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The hyper-Kahler quotient construction is an obvious generalization of the Kahler quotient con- 
struction described above. One starts with a hyper-Kahler space 971 admitting a group action G which 
preserves the metric and three complex structures. The isometries correspond to vector fields which 
are tri-holomorphic Killing vectors, that is (i) holomorphic with respect to each of the three complex 
structures Cx&^ = and (ii) preserving the metric Cxg = 0. Associated to each complex structure 
and Kahler form there is a moment map, defined as in the Kahler case above, which we can assembled 
into the triplet jl. In an identical way to the Kahler case, one can prove that the quotient 

Wl = fl-\0)/G (B.27) 

is a hyper-Kahler space of dimension dim 971 — 4 dim G. In the hyper-Kahler case, a basis of vectors 

~ ( c ) 

normal to the level set is provided by the 3dimG vectors J X r . As in the, Kahler case a natural 
metric g(X, Y) is induced on 971 by taking g(X, Y), with X and Y (denoted by the same symbol) being 
the unique lifts of X and Y to "K. The local condition for a vector X e T971 to be in "K is 

g(X,I (c) X r )=g(X,X r )=0, (B.28) 

generalizing (B.21). In the SU(2) x Sp(n) basis this condition is X^ttj-e^XpT^ = X^-e^xf = 0, 
or equivalently 

X^n Vj X^ = . (B.29) 

The fact that the SU(2) indices of the tangent vector are un-summed, allows us to identify symplectic 
tangent vectors to the quotient space. Their lifts are simply symplectic tangent vectors of 97T subject 
to the projection (B.29): 

MV- jX/ = . (B.30) 

Of particular importance to the instanton calculus is the volume form on the quotient space, gener- 
alizing the expression (B.26) in the Kahler case. The Jacobian factor is, as before, the determinant of 

~ ( c ) 

the matrix of inner products of the basis vectors i" X r normal to the level set. We have, 

g(I (c) X r , I id) X s ) = 5 cd g(X r , X.) - e^g(X r , J (e) X s ) = Xs) C = d > (B.31) 



d , 



where we have used (B.19). This means that 



J(x) = |detL| 3/2 , (B.32) 

where the matrix L is defined in (B.24). As previously the volume of the gauge orbit through a point 
of the level set is (B.25). Hence the G-invariant volume form is 

„ TP dimG 3 

/ « = — / u,\detL\ J] n<% (c)X • (B.33) 
J m VoiG Jot ^ ^ 

Note the factor of |det L\ as compared with the case of the Kahler quotient (B.26). 

As an example of hyper-Kahler quotient construction which is directly relevant to the ADHM con- 
struction, let us start from flat Euclidean space 97T = M. An . In this case there are local coordinates z ia 
for which the one forms h ia = dz ia . The symplectic matrix is simply 

Cl = ( 1 ° 1[n ' xW ) . (B.34) 
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and the flat metric is 

g = 47v 2 n..e^dz la dz^ . (B.35) 
In this case, there exists a hyper-Kahler potential 

X = tk-e^z^ . (B.36) 



We now consider the hyper-Kahler quotient of R by some compact group G of tri-holomorphic 
isometries. The most general isometries of this type are generated by the vector fields 

X r = IT'.: 1 '— , (B.37) 

where T r generators in some 2n-dimensional representation of G which satisfies 

{QT r Y = QT r . (B.38) 

The moment maps are 

jpr = - i J« (QT%z^ e^T* $ - C , (B.39) 
where ^£ r T r , are arbitrary constant elements in the abelian component of the Lie algebra of G. 

The (gauge un-fixed) volume form on the quotient follows from the general formula (B.33): 

L u= voW 1 n i detL i n u^^^** 7 **-?^) » (R4o) 

where, as previously, 

L rs = g(X r , X s ) = -^e-pT^Tf/P . (B.41) 



For this example, we now discuss more concretely the connection on the quotient and find an 
explicit expression for the Riemann tensor. First the connection. As we have explained previously, the 
prescription is simple: lift tangent vectors to Wl to 3i (denoted by the same symbol). The Levi-Civita 
connection is then obtained by projection to "K from that on 971. Locally, the tangent space of the 
horizontal subspace consists of vectors X that are orthogonal to the 4 dimG vectors {X r , I X r ], the 
conditions (B.28); hence, for two vectors X, Y G T9JT(~ %) 

V X Y = (V X Y) 11 = V X Y - {VxY) 1 - , (B.42) 

where 

3 

{VxY) 1 = 52L£{g(V x Y,X.)X r + £ g(S7 x Y, I (c) X s )I (c) X r ) . (B.43) 

rs c=l 

We can make the inner product more explicit by using (V ' X Y)™ = X^dY^/dz^ and the expression 
for X r in (B.37) 



~g(V x Y,X r ) = -iX l "(QT r )rrY^e^. , 

~g(v x Y,i ic) x r ) = -x^iCrr^e^r^ 



~ r (0 (B.44) 
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In deriving these expressions we used the facts g(Y,X r ) = g(Y, I^X r ) = and, from (B.38), QT r is a 
symmetric matrix. 

The Riemann tensor of 9JT can be expressed using the standard formula in terms of the lifts 
W, X,Y,Z G %: 

R(X,Y,W,Z) =g(W,V x (V Y Z)" - Vy(V^) 11 - V [x , y] „Z) 
= ~g{W,W x iy Y Zf - V Y {V X Z)W - V Z {V X Y)W + V z (V y X)" - [Z,[X,Y]]) . 

Notice that some of the projections are unnecessary here, since W G "K and [Z, K] G V for any K G V. 
Using the fact that g(W, V x (V y Z) ± ) = -<?(V X W, (V y Z) ± ) we can write (B.45) as 

i?(x, y, w, z) =ii(x, y, ^ z) +ff((v x z) 1 , (Vy^) 1 ) - ^((W) 1 , (wz)- 1 ) 4 

-^((v x y)- L ,(v w z)- L )+si((VyX)- L ,(v w z)- L ) , 

where i? is the Riemann tensor of 9JT. In the example relevant to the ADHM construction this vanishes 
since the mother space is flat. 

We now extract the components of the Riemann tensor in the z ia basis by choosing X = d/dz ia , 
Y = d/dz^, W = d/dz^ and Z = d/dz U . One finds 

= E {(nnij^^ki + (nr-^^jk + (^^(^ji} • (R47) 

rs 

This forms reflects the decomposition of the Riemann tensor in (B.14) and so we can extract the 
symplectic curvature of the quotient: 

= 2 E {(n^WC^ki + (nr-^^jk + (nni^™ • (b.48) 

rs 

Since, from Eq. (B.38), f2T r is a symmetric matrix, it is apparent that the symplectic curvature is 
totally symmetric in all its indices. 
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Appendix C: ADHM Algebra 



In this appendix, we collect together most the pieces of ADHM algebra that we need in the text. 
Performing ADHM algebra is rather an art, requiring a good deal of chicanery and experience. The 
ADHM constraints and identities (2.50), (2.55), (2.61), (4.30) and (4.31b): 

A«A / j = <JV" 1 ' A"b a = b a A & , b a bP = 6jl lk]x[k] 
A A M + M A Q = , Mb a = b a M 

are used pervasively. The following differentiation formulae which are proved using the definitions (2.50) 
and (2.51), are particularly useful: 

{ — f ct &a b A ■ f 
_% h : a , (C2a) 

□/ = -Afb a Vb a f , (C.2b) 
d n V = -A^fa^KV - Vb a a naA fA" , (C.2c) 
UP = -4{V, b a fb a } + 4A d /fo Q P6 Q /A d . (C.2d) 

To complete this compendium of differentiation formulae, using the expression for the gauge potential 
(2.49) and the ADHM identity (2.47) along with (C.l), for any J(x), we have 

V n (UJU) = dniUJU) + \A n , UJU] 

= U d n J U - Ub a a na( ,fA«JU - UJA & fa™b a U , (C.3a) 
V 2 (UJU) = -AU{b a fb a ,J}U + AUb a fA & JA A fb a U 

+ Ud 2 JU - 2Ub a fa naA A & d n JU - 2Ud n JA A af l a fb a U . (C.3b) 



Finally, there are various other tricks that we will explain in situ; however, there are particularly 
useful ones involving quantities of the form 

j_ B[ N ] X[N ] 0[jv] x [ 2 fc] 

\ [2k] X [N] C[fc] x [fe] 1 [2] x [2] 

The pair of identities is 

A«JU = (a & J - Ca & )U , UJA & = U{ja & - a & C) . (C.5) 

The first of these is proved by expanding A = a + xb, as in (2.45). Then with b assuming the canonical 
form (2.57) we can commute xb it through J , picking out the component C, and then use to (2.47) to 
re-write xbU = —all. The other identity is proved in a similar way. 

Note that where possible we suppress indices. However, in most situations the spinor indices a and 
a need to be written explicitly because they are often not contracted in an obvious way. 

Osborn's formula 



(C.4) 
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This identity reads [8] 

ti N F^ n = -g- 2 D 2 ti k logf. (C.6) 

We start by expanding out the left-hand side using the ADHM form for the field strength (2.52): 
ti N F 2 mn = -leg-hikiKVti 3 fb a Vb a f + b a Vb a fbpVbP f) . (C.7) 

Now the right-hand side. Firstly, 

□ tr fc log / = tr fc (f-'Df -f-'d n ff- 1 d n f) 
= -2tr k (b a Vb a f + 2f) , 

using the differentiation formulae (C.2a) and (C.2b) along with the expression for V in (2.51). Then 
employing the differentiation formulae (C.2a)-(C.2d) once again 

□ 2 tr* log / = 8tr fe (b a {V, b^fb p }b a f - b a A & fb p VbPfA & b a f 

- b a A & fbpPb a fA & bPf - b a VbPfA & b a fbpA & f (C.9) 
+ b + aVb a fb^f + 2fb a Vb a f) . 

Then using the ADHM identity (2.55), the definition of V in (2.51) along with (2.61), this becomes 

□ 2 tr fc log / = 16tr k (b a VbPfV*Vb a f + b a Vb a fbpPbP f) , (CIO) 
precisely — g 2 times (C.7). 

Zero modes 

We prove that the quantity 

A Q (C) = UCfbJJ - Ub a fCU (C.ll) 

satisfies the zero mode condition 

P dQ A Q (C) = . (C.12) 

From Eqs. (C.ll), (C.2a) and (C.3a) we calculate: 

D« a A a {C) = 2Ub a f(A"C + CA«)fb a U . (C.13) 
Hence the condition for a zero mode is 

A A C + CA« = . (C.14) 

Expanding A(x) as in (2.45), we have 



CM 3 = bfC X3 . 



Derivative of the gauge field by a collective coordinate 



(C.15) 
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We now prove that 

g ^L = -vj-^u) + uJ^fa n bU - Uba n fJ^U . (C.16) 



Firstly, using UU = 1, 

Next we insert 1 = UU + A/ A into the middle of the 1 th and 3 th terms, and then use the fact that 
A n = g~ x Ud n \J to arrive at 

9^ = -Dj-f-u) + ^AfAd n U-(d n U)AfA^L . (C.18) 
y dX» \dX^ ) dX* <9X^ v ; 

From the ADHM identity (2.47), we have 

Ad n U = -(d n A)U = -a n bU , (d n U)A = -Ud n A = -Uba n , 

rrdU dA TT da TT 8U , -OA -da (C19) 

A dx = 'ox 11 = ~dx u ■ = ""ax = • 

from which (C.16) follows. 

Corrigan's inner product formula 
The expression to be proved reads: 



/ 



7T 2 



Here 



d 4 xtv N A(C)A(C) = -—tr k [CiVoc + l)C'-C'(Voc + 1)C] . (C.20) 



^ = Hm p = 1 _ 66 = ( JWp*i ) , (C.21) 



as per Eqs. (2.51) and (2.105). 

The strategy of the proof is to show that the integrand is actually a total derivative, 

t rjv A(C)A(C") = f □ tr fc [C(V + 1)C"/ - C"(7> + l)Cf] , (C.22) 

after which Eq. (C.20) follows from Gauss' Theorem, together with the asymptotic formulae of §11.4.3. 
To verify this, let us first write out the left-hand side of Eq. (C.22): 97 

tr [ [C'VC - CVC')fb a Vb a f - CVb a fC'Vb a f - VCfb a VC'fb a ] . (C.23) 



97 Here, and in the following, the trace on the right-hand side is either over instanton or ADHM indices, depending on 
the context. 
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We have used the cyclicity of the trace, together with the definition (2.51) for the projector V. Turning 
to the right-hand side of Eq. (C.22), one calculates: 

|Dtr[(7(P + l)C7] = Jtr[ - 2C{V,b a fb a }C'f + 2CA & fb a Vb a fA«C'f 

- 2C(V + l)C'fb a Vb a f + CAJa n&a b a VC'd n f 
-CVb a o na6l fA«C'd n f] 

= \\x[Cfb a Vb a C\V - 1) - C{V + l)C'fb a Vb a f 
+ Cfb a VC'fb a V + CVb a fC'Vb a f] 

= it rjv A(C7)A(C") - \tr[{CC + C'C)fb a Vb a f] . 

Here the expressions on the right-hand sides follow from the differentiation formulae (C.2a)-(C.2d). 
We have also invoked the relations (4.31a), (4.31b) and (2.51) and, once again, cyclicity under the 
trace. From the final rewrite in Eq. (C.24), the desired result (C.22) follows by inspection upon anti- 
symmetrization in C and C . 

Covariant Laplace equation with bi-fermion source 

We now prove that for an adjoint- valued scalar field the solution of the covariant 
with a bi-fermion source: 

V 2 </) = A(C)A(C") 
and boundary condition lim^oo <f>(x) = 0°, is 

<j> = -\ucfc'u + u( ct) ° )u, 

where 

is a k x k matrix. 



Laplace equation 
(C.25) 

(C.26) 
(C.27) 



First of all, using the asymptotic formulae (2.105) it is easy to see that (C.26) has the correct 
boundary condition at infinity. Next we employ the differentiation formula (C.3b) with J = jCfC' 
and compare with 

A(C)A(C") = -UCfb a VC'fb a U + UCfb a Vb a fC'U 

+ Ub a fCVC'fb a U - Ub a fCVb a fC'U . ^ C ' 28 ^ 

Using the differentiation formula (C.2b) we find that the 3rd term in (C.3b) matches the 2nd term in 
(C.28). Then writing V = 1 - A^/A" in the 1st and 4th terms of (C.28) and using Eqs. (C.14) and 
(C.15), Aq,C = —CAa and b a C = Cb a , and the differentiation formula (C.2a), we find that these terms 
match the 1st, 4th and 5th terms in (C.3b). This leaves the 3nd term in (C.28) which matches the 2rd 
term in (C.3b) apart from the fact that V is replaced by V — 1. Hence 

V 2 {\UCfC'U) = A(C)A(C") - Ub a fCC'fb a U . (C.29) 



Now consider (C.3b) with 



J= " ) (W = (C.30) 

U V 9i [2]x[2] / 
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We find 

V 2 (UJU)=m4-{f, V } + fA«( 4> * )A„f}b a U. (C.31) 



up 



Then 



A A °) A a = w^°w a -L<p + {<p, r 1 } , (C.32) 

where L is defined in (2.125). Putting this together with (C.29) we have solved (C.25) if 

Lip = w«<p Wa + \CC . (C.33) 

Anti-fermion source 
We now prove that 

S aAB [0 a , A(M B )] = V a ^\ + A (A/a) . (C.34) 

Here, <p a is the solution to (C.25) given in (C.26). The collective coordinate matrix M A (which will be 
seen to depend in a nontrivial way on the original collective coordinates {a,A4 A }) is subject to the 
usual zero mode conditions (4.31a)-(4.31b). 

We now solve (C.34) for ip &A and M A . First of all, from (4.64) and (A.20) 

Z a AB<Pa = -le AB cDUM c fM D U + Z aAB u( (p « )u, (C.35) 

\ u 1 [2] x [2] / 

where <p a is defined in (4.65). 

Va = L 1 {^l aAB M A M B + w & 4>° a w & ) . (C.36) 

As usual in ADHM calculus, some educated guesswork is required. To this end we expand the left-hand 
side of (C.34), using Eqs. (4.29) and (C.35): 

j: aAB [<p a ,A(M B )] =\e ABCD U^M B fb^M c fM D -bJM B VM c fM D 
- M c fM D VM B fb a + M c fM D Vb a fM B \u 

(C 37) 

+ - M°SKV (« I ) + KfM*V (f ° ) 

Here P is the projection operator defined in Eq. (2.51) above. Since d n A = ba n , a comparison of 
Eqs. (C.37) and (C.3a) motivates the ansatz: 

^a = + + (C.38) 

where 

</K = -^abcdUM 3 fA^M fM D U , (C.39a) 

m = \v aAB u{M B fA, ° a ) + (t? ^ a ) A ^ B } c/ ' ( c - 39b ) 

^£4 = ^(0 ^ = -^a, 9„^a = 0. (C.39c) 
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We expect ip^ to account (more or less) for the first four terms on the right-hand side of Eq. (C.37), 

~('2} ~ (3} 

and ip A to account (more or less) for the final four. The presence of ip A , while less obviously motivated 
at this stage, will be needed to ensure that the quantity Ma defined in Eq. (C.34) obeys the zero-mode 
constraints (4.31a)-(4.31b). 



By explicit calculation using Eqs. (C.2a), (4.31a), (4.31b), (C.3a) and (C.39a), one finds: 
V a ^T =Vabcd u{ M B fb a VM c fM D - b a fM B (V - l)M c fM D 
+ M c fM D Vb a fM B - M c fM D (V - l)M B fbAu . 



(C.40) 



Except for the "—1" in 2nd and 4th terms, this reproduces the first four terms of Eq. (C.37), as expected. 
Similarly one calculates: 



V adL ip 







T = -V aAB u\ M B fb a (V + 1) ( f °J - b af M B (V - 1) (** ^ 



+ ( <> y (p + i)b ° fMB - ( s & - wfi- 
- bjAa ft? 1) ***** + MBfAa ft? KDfK w 







(C.41) 



1 »l„ , CAB 



+ 'o" ^) VK,M °-^ °J MBfK 



+ Kf 



+ 



Pa 
ip a 



M u -M B y a 



fba 



- \e AB c D u[b a fM B M c fM D + M c fM D M B fb a }u 



so that the last four lines of Eq. (C.37) are accounted for, as well as the "—1" terms in (C.40). Here 
the final equality follows from the commutator identity (C.32): 



= -\t: aAB M A M B + w a , r 1 } 

implied by Eqs. (2.50), (2.125) and (4.65). Finally, 

'0 



(C.42) 



v a d { T = w{ b a fA & r Q ^ 



= 2U\ b a f 




1 <J'\ 



G\ 



- Q°A a o. 



+ 







g% 



■ 1 da — CLcyQ'a 



(C.43) 



fba }U , 



where to obtain the final equality we used the moves summarized in Eq. (C.5). 

Next we sum the expressions (C.40), (C.41) and (C.43), and compare to the right-hand side of 
Eq. (C.37). By inspection, we confirm the ansatze (C.38), where the Grassmann zero mode matrix has 
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the form 

— aAB U <p a 



M B -M B <p a 



+ 2(° ^)a d -2a d ^. (C.44) 



Up till this point we have yet to solve for Q\. This is accomplished by inserting Ma into the fermionic 
constraints (4.31a)-(4.31b). One finds that Eq. (4.31b) is satisfied automatically by the expression 
(C.44). In contrast, Eq. (4.31a) amounts to 2k 2 independent real linear constraints, which is precisely 
the number required to fix the anti-Hermitian k x k matrices Q\ completely (the explicit form for Q\ 
is not required). 

Super symmetry transformations of the fermion zero modes 

From the supersymmetry transformation of the fermion zero modes 

5A(M A ) = -iE^ip^s , (C.45) 

and the expression (4.64) for <p a in the instanton background, we now extract the transformation of the 
Grassmann collective coordinates 

5M A = 1VZ AB £ad% , $M A = 2iZ AB U B C: , (C.46) 

where 

Ca& = (t? aa ~~ aaVa ' ^ = s * ( o° I) " ■ (c - 47) 

The proof begins by expressing 8 A in terms of a variation of the c-number collective coordinates, as 
in (4.68), and a variation of the Grassmann collective coordinates that must be determined. Up to a 
gauge transformation, one finds 



5A a (M A ) = A a (5M /A ) + i& B u{ - M B fA«M A fb a - Mfb a A«fM B + M B fA«b a fM A 
+ b a fM A A & fM B + M A fM B A"fb a + b a fA & M B fM A \u . 



(C.48) 



In order to derive this we used 

5aa = i£a.AM A , 5a,a = i£,aAM A , 

SU = -{U5U)U -USciafA" , 5U = U(U5U) - A & f5a & U , (C.49) 

5f = -\f{8df l A dl + A«5aa)f. 

Notice that only the last term in (C.48) depends on the variation of the Grassmann collective coordi- 
nates. 

From the supersymmetry transformation rule, the right-hand side of (C.48) must be equated with 
the right-hand side of (C.45) where a assumes its value in the instanton background as in (4.64). Using 
the derivative identity (C.3a) and properties of the S-matrices, one finds 

-%L AB Tt)^alB = - H„nl ^M A fyb,.,fM n + b a fA°M A fM B + M A fM B A°fb a 

- M B fA% a fM A - b a fA*M B fM A - M B fM A A«fb a (C.50) 

-ac^wA'p- ;)^. 
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One can verify using the fermionic ADHM constraints (4.34) that the first six terms in the above are 
equal to the first six terms in (C.48). This means that the variation A a (8fA A ) is then equated with 
the final two terms in (C.50). These terms are not quite in the right form due to the presence of the 
x-dependent A and A terms. However, this can easily be removed by using the tricks (C.5). When this 
has been done one extracts the variations of the Grassmann collective coordinates given in (4.70). 

Variation of the fermion zero modes 

We now prove that under a variation by a collective coordinate 

dA(M) 



where 



+ [%,\(M)\ =pQ li + A(dM/dX»), (C.51) 



da" 



Using (4.29), (2.112) and (C.19), the left-hand side of (C.51) is 

u{^f^{KfM - Mfb a ) + (b af M - Mfb a )A a f^ + ^fb a - bJJ± 

+ + - Mm-,** + • 

Using (C.3a), the right-hand side of (C.51) is 

f 8a 61 - fin 61 - da" - c)M - 8M 

~ U {oxT fbaAafM + M ^^dX, + b ^0xT fM + dX» fK - Kf dxT 
f)n a — — — f)n a f)n a — 1 

+ dx^ fMAafK + bJAaMf dx^i + Mf dx^ AafK r • 



(C.53) 



(C.54) 



The difference of (C.53) and (C.54) is 

H Mf ^ A&+ ^) fb "- b ' f (§-^ +A ^V M } u ■ < a55 > 

But this vanishes by virtue of the constraints (2.109a) satisfied by = daa/dX^. 
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